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#M2 003-329115 ^--J ; y 

im&m i ] 

4Afi>s*?M, NS4B NS5A^>/t^jrxo«NS5B^^/'C^fr*3- Ki"4tt3IE 

< & 1 oo»R7- u -mm-JJt U *°- * -Mc^. £r>^& < t 1 OOIR 
[fif*3l3] 

K^I#^-9X*il 0^$*t*3a*S^iJ^?,^S5'*afi««ii:. 4>&< fcfe loot 
^ v - * - L < f 4 U sjf - * - jtfS^ , IRESSe^iJ t s MUJ-M2a<D C MJFFifc: 7 -f 
;i/X<7>ry ARNA_h<7)NS3* W^JC, NS4A* >v?^fC. NS4B* NS5A*W\^ff 

ao«NS5B^wf^K*3-K-r*ttafeK2?iifc, ie^ij#-^i ixm 2-emzti&v&mm 
mu?-m2a<D c mm-&* -f^w arna^\ w&m^ 3 Xf4 5 a^m^ie^j 

*»<b*«RNA-C*4, WftRl ~3<7)^irtifrimzm.<DU7V 3>RNA 0 

<£> (a) X (4 (b) ORNAri* fe44l/^3 >RNA 0 

(a) E^!l#^-13lt±2T^$astt3IE5iJ^?>35r*RNA 

(b) @B?iJ#^-13l{4 2t?^$tL^m^E^Jtci3v*-ci~l OlHO^Ig^^ S#IXt4tt 
f»*S 1 ~ 5 OV^-TtL^ l JRRftoi^y a >-RNA£#fflJ&K2SA-f £ £ 4 t) ffcj^$ 

*««Ufed*, Huh7» N HepG2ai&> IMY-N9«, HeLaiTO. 2UF293ifBJJ&* 

*) man* ^-ftifr<DmMT-$> % , m^m 1 mm.<o i/^ja ^tt«ama„ 

[BMSg 10] 

5 ov^-f tifr 1 ^ie^6D^-7 s y a >RNA 0 

1 1 ] 

9 o^-ftifr 1 mmvu-yv 3 ^^« Q 

1 2 ] 

iHfcOPyy 3>rnAo 
1 3 ] 

1 4 ] 

W^6~9 0v>-f^l^|H«OVyy 3>^aHfl&*^ vyj 3>RNA£»ffl1-£ 

MisJ-M2a<7)CMM-^^ ^Ol/'/'J 3 >RKAO«**fe 0 
[»t«15] 

fflf*3i 6-9 OV>r^ 1 SBHOl/yj 3 >«lMSfflJ&£i«U *L«Jg*tt*& 

miE#2 0 03-3 1077 8 3 
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[St l 6 ] 

1 7 ] 

Ixy; 3 >««Mfflfl&S:^«-r4XSS: 1 BJBLbfif d - t iUKT-Stea© C iUfrifc * 

[IS^a l 8 ] 

[» 1 9 ] 

feiRV-'/Va ^RNA£IR#-r S £ t &#tr % % JBW***:*: L 7t«fe : f-M2 
aO C ISffFsfe * 4 i\> 7s <7) v 7* u 3 j/RNA* £ 
[ft 2 0 ] 

C MfFife <D Vzf V 3 >RNA£ Wt-t %> -ft&o 

[St*«2 1] 

se^m-^- i tt4aafEyii±jc^ v>r, ito (a) - (u) : 

(a) iM£#^7 1 5 T^gM&K&lt&A^ibG^O^^ 

(b) J^#-5§-4 9 5 5 <DU$Lte&lf&CfrbU^<DW.^ 

(c) ^#-^-4 9 3 6 Og&ft K:j3tf& A^G^tf)^?^ 

(d) ^&l=-3§-5 0 0 0OgM£^;feUt&A^ibG^O^^ 

(e) &3fe#4§-7 2 8 8 OgRl&fc&tt&A^ibG^O^^ 

(f) ^#-^-5 9 0 1 0^iCi3tt^)G^^U^.O^ N 

(g) ^*#^-6 1 1 3 <DU%Lfc&W2>kfrb\}^<D^m, 

(h) &&&^r2 8 9 0C9^i^*3tt^)A^^G^<D^^ 

(i) ^ft#-^6 8 2 6<9gM£t;:i3tt<5>C^£>A^?>i^ 
(j) ^^#-^-6 8 8 7 (DU&fc&lf&Cfrhh^tD^m, 
(k) ^^#-^-6 5 8 0 0$$$Ll,Z$5lf2>\]frbA^<D^m. 
(1) &&#-^7 1 5 9 <DU®-fci$ft2>\ifrbC^<D^m^ 
(m) aafef^- 7 2 3 0 <DU$L \Z & if & h A^'D&m, 
(n) ^^#-^-6 9 4 3 33 tt£C^£>A^<^^ 
(o) ^^#-^-5 6 8 7 <DU&fc&li&GfrbA^<D$i.g;, 
(p) m*#-^-6 1 1 CKDU&tetelf&AfrbG^Vgcm, 
(q) 5 5 5 0 (DUtiLK&tf &\}frbQ^<D^m, 
(r) ^#-^-7 2 1 7 <DU%L\Z&tf&AfrhG^<D^^ 
(s) S63£#-5-3 6 4 3 <F>U%Lfc&tf%>AfrbG^<D^m, 

(t) J&^#-5§-5 8 5 1 <DU$LlZt3tf&GfrbA^<D$zm^ RV 1 
(u) ^#-^-5 9 1 4 (VU&fc&if&GfribA^^m, 

*)M$lZti2>&%< t i> loo^tr^Tf *3£l&I2«^&£ vVy 3>RN 

. mH#2 0 0 3 -3 1 0 7 7 8 3 
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^ST-^aOCMJffifc'^VT. (HCV) VJ Afi^O^m^^tr^m*^ 

[0 0 0 1] 
I0 0 0 2J 

CSfFi&rW (Hepatitis C virus, HCV) i±, "7 9 fcftf >f ^T^K^-f^ — ^il 
oC+JSH-fe^RNAfc^V Aif^^^f CSffF^OJIH h £ t tf&bfrx 

$ ti& -1 i: tffrfr-o X § fc 0 mft^T?^ &HCV3tf£^M<7>£lli&T-& £ , Simmonds b K 

j:^hcv m<vmmw.m*m^tz&m.mmmx^t hcv fiit^Mia, mhf-mik aus^ 

352a, mttM2b, itfcT-M3a, itf5M3b^ 6 9 4 7*1' (^ttXtl) > $ 
bK^Kb^&Z 4 -fifi^ < ofr<D*)-7?4 yi:5-i$tLTV^ 0 gi^ET-fi, HCVO^ 

2-5) o 
[0 0 0 3] 

JfrifeSrH^U -e©HI!l2 0%©lf^l 0^-2 0^tgtfF»^1rL> $ b \Z 
[0 0 0 4] 

•f&o ^^-7xn^^S» HCV^m^Sat^l-^^h^^ttT^!). 

Gfc#iraifc6) o 
[0 0 0 5] 

xm@^*5^T«m^iS5<. m^^^ij^^^^^. fr^m.&tem®r&m&tf& 
y YtLxn, Hw<Dmmwm*mv<Dnmm$k<Dmu^z-btLZ> 0 

[0 0 0 6] 

ttz, KQ^Km^m^nmsm^mmit^y^y^-L^^^iztzib^ hcv^mi 
m^mmmm<Dm^mmx^>^rzo L^Ltat^ot, iicm*<7>&mmmz^-t 
zimtLx, ncv^r^y AWkuyv xytf^ztttzztizz *) mw?Xffi2, 
M7~i 0) , &mmn&*m^xHW(vmmmmzMffiir&£ttfvrifet%<>fz 0 z 

tlb<DHCVy-7<r; ARNAV 7*<j 3 >f± N itfc^lUbOHCV^V ARNAOS'^m^tf^HC 

v iREsoT»i^^fti-^)«it^ >'*?k*. i/ym&mttx.vt^vTmi.zMigi 

LtzEMCV IKESl,z£-oXmikLtz1s(DX$>&o COMA]/?') 3 fli, ^ hJff^«Huh7^ 
fALT^tv^fi/ >^Tt^tt -5 £ t «fc t) , Huh7#IBJEl*!-C gflMSi£-f £ £ t tf*gE 

[0 0 0 7]' 

L^L&# ? <b. iOJ:^ ^HCVO»|*iRNA«^(i. it^HlbOHCVO^y ARNA 
=Srfflv^ci^L^M^^Tv^v> 0 ^^^,itfg^-SOHCVt?{±3- K£ft£ -7>f ;i/7^ 

ffi.lEff 2 0 0 3-3 1 Q7.783 
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■7° £&{?1-#lHCV^ * H^M" * i t Itm \zm L #x. <b o 
[0 0 0 8] 

WfrMvfclK 1 ] <&m 2 0 0 2 - 1 7 1 9 7 
[#fp^m2] #PB2 0 0 1 - 1 7 1 8 7^<2r^ 

[*#H3tfU] Simmonds, P. et al, Hepatology, (1994) 10, p. 1321-1324 
[*#M : Xltt2] Choo et al., Science, (1989) 244, p. 359-362 
[*4MWOR3] Kato et al., J. Med. Virol., (2001) 64(3) p. 334-339 
teMMRtiU] Okamoto, H et al, J. Gen. Virol., (1992) 73 p. 673-679 
[WWWf~XMi5] Mori, S. et al, Biochem. Biophis. Res. Commun. , (1992) 183, 
p. 334-342 

[3¥WWf-$M6] Yoshioka et al.. Hepatology, (1992) 16(2): p. 293-299 

Lohmann et al.. Science, (1999) 285, p. 110-113 
[3&fcWF£Wl8] Blight et al., Science, (2000) 290, p. 1972-1974 
[#M£H\fc!t 9 ] Friebe et al., J. Virol., (2001) 75(24): p. 12047-12057 
0] Ikeda et al., J. Virol., (2002) 76(6): p. 2997-3006 

mmm®: l x o t -r & mm] 

[0 0 0 91 

[0010] 
[0 0 11] 

[1] mfS^-M2aOCMfl : i^ I >-r;V^O^y ARNA±0, NS3^^^^K 
> NS4A* >r«*Jt> NS4B*W??K> NS5A* W?* JtRtfNSSB* V/** JC£ 3 - Kf 
*@E^1I t > 3' INBfim t < t & ^tr h & h „ I//'J3 >RNAo 

3 VRNAti, < fc & 1 -ftfe^Xfi U #~ * -Mitt, RV&tk 

< t i> 1 OOIRESE^I** feU'&tri Lv> 0 

[2] @2^iJ#-t9X(±l OOV^^lo-e^^ttSttlfeE^fe^SS'^liflR^tttt. 

^<tu u -itfc^pg l < a u #- * t % iREsie^o t > ans^ 

H2a<DC5gJH : |fe*>f /W*0$V ARNA±<7>NS3* NS4A*W?*Wx NS4B* 

S> NS5A*>'r?:5'|OlO ? NS5B*' 3- Ki" &^SB?"Jfc. ETdH^ 1 lXlil 2 

OV>Wlo^S*i* 3&&@B?!) ?, * * 3 * ^flKft ^ i: tr Jfi*B5?!] £ * & , 
V7*'J 3>RNA 0 

[3] atfe^ltfa© C Mflfifc RNA#\ 3 X l± 5 T?^ $ *t a dft^E 

?!]#><b&&RNAtr£>&, ±fB[l]Xtt[2]E*OVy; a^RNAo 
[4] »TO (a) X J± (b) ORNA#» S> & £ V y y 3 >RNAo 

(a) E?U#-^lXt±2^§*t*ttafeE^|36 k fe«f*RNA 

(b) @B^iJ#-^ 1 Xii 2 -T?3F$*L;&aUfeB?!|fci3V>T 1 - 1 0fl4>:otffeP4c£> M&Xtitt 

[5] ±.E[i]-[4]iB«ov^^oi^yy 3 vm*«Bift^»xi-&t fcJCj: *)fls»$;K, 
* ' . ' miE#2 003-31 0. 7783 
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M&36»S*«BJftT?**Ci:3& t J:l}jeF*L<, Huh7», HepG2,«. IMY-N9M. HeLa 
M> &U f 293*fflJE^<b&3S£ £ ^aWixiv^i'tt^OjWIllfe-CsbSifc^^feHieFt Lv> 

o 

[6] cmw?£L*'()\'xm%iOTi3ffi%mL< awmomst* xi±mffiotnt><D, ±ib[i} 

~ [4] IfittO !✓ 7 U a >RNA G 
V a VRNAo 

[10] ±|B[5]fB«OV^V 3>if»^i/yij a>RNA£Slffli-&£i:*'&'tN jSM5 
[11] _tiS[5]!B«£c9I'7 Pl J 3>ti||fttKftU nhtiztg^tfofrbVJ 

[12] tUMMlOffftTT*, ±IB[5]|B«OV/y a^«Hifit««l^ #<b*t&J£* 

[13] ±ia[5]ffi^Ok^U 3 VtUl^^t^V^'J 3 >RNA«#U JDtf#Lfc8UR 
m * ft*-f & It Sr 1 HI \>l±ft 9 £ * * tr „ «fis?-S2a<& C M J§fi£ ^w^vyjn 

[14] _L|B[5]|B«tOVyj 3>^|Sl^^C,^$aV7 p V3>'RNA?rflJl#L, JRftbfcHS 
J6*^«t4ie* 1 EUJJUbfrd S^f ^i:#?,^l/yv 3>«»^?,« 

[15] ±m[un^n<D-^mzx^xmm.^fiti^mm^m±Ltiiy^^ s^rnakov* 

*L^3ft«*Xt±7 5 y®fI^iAtSi i: Sr^tr. Wig* L fcitfeT-M 2a<£> 

C aiFFife -J'f^Ol/^js >RNA£ »jt"t & 

[16] ie^ij#-^l T*^$tL-S^@e^!|±^i5V>T^ JKTF<0(a)~(u): 

(a) m&#-!§-7 1 5 7 0*BftK:*>t*A*»?>G^OgE^ 

(b) ^#-^4 9 5 5<0<»ifcfc:J3»t*C#»&U^<Z>3e*x 

(c) 4 9 3 6 0£M£Ki3tf£A^ibG^<7>^, 

(d) ^#-^-5 0 0 0O«ifcfc:mt3A*&G^oag». 

(e) mm^7 2 8 8 0»fi:»c*»t*A**feG^oaE*^ 

(f) m^#-^-5 901 o«sfi:^*tt*G^?>u^oag*, 

(g) m^#^"6 1 1 3<3»ifcfc;mt3A*ibU^©a£!k 

(h) m»^"2 8 9 0<DMmzi$W2>kfrtoG^<O&m, 

(i) ^#-^-6 8 2 6^SBttfc:l3»t&Ca»'bA^03e#^ 
(j) ^&#^-6 8 8 7 <D&mz$$lf2>CfrbA^<D&^ 
(k) m^#-i"6 5 8 0O«RffiH*»t*U*»feA^O3BII, 
(1) *£&#^7 1 5 9 0«Pfir^^tt^>U3&^C^O^> 
(m) ffi3MI^-7 2 3.O©«Bftfc:mt3Uj&*&A'M03E*> 
(n) ^*#-^6 9 4 3O«ffcfc*5»t4Cj&»fcA'M0gE*. 

/ miiE# 2 0 0 3 - 3 1 0 7 7 8 3 
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(o) *S*#-^5 6 8 7 <OUtiLK&lf&GfrbK^<7>%Z^ 

(p) ^m^6 1 1 O^SISfcK&tt&A^G^CD^, 

(q) U&^5 5 5 0<D^m-i$VZ>\)frhC^(D^m, 

(r) ^#-^7 2 1 7«SKti:^(t^A*t)G^O^ 

(s) ^#-^3 6 4 3 <?>TO:&-j3&t&A^<bG^O^, 

(t) ^#-^-5 8 5 1 ^teKislt^G^A^O^II, 

(u) Mff 5 9 1 4 0^(C^^-&G^^A^O^, 

A 0 

[3PJ3 <£>$»:£] 
[0 0 12] 

») , itf5^M2a<^HCV^^fi5|5-tJZ,HCV-RNAP7 P '; 3 >^a6T^$ tL^ 0 
#f&H;§K#& 1^7°'; n >«»«BJfeJ±. it^M2aOHCVfi^ORNA25.(y $ HCV^ >/^«*}$ 

[fPJ£»1-££#OgJi<DiJSt] 
[0 0 13] 

[0 0 14] 

1 . #f£im~#3HCVft5fc<Dl/7 P V a >RNA 
CSWHfe'W/l'* (HCV) <or7Alt &9600** V**- K*6fc* #fi|RNA 
Z.<0¥ J ARNA{±> 5'^MHK5(tt (5'NTRX{±5'UTRt > 
^4K&««ft2:^&4lJft$iL«fl«^«4t. RO'S'MKil* (3*NTRX&i3'UTR2: k^fE^^ 

) frhta&o *<om&mmz\*wi<Dm&9 Y%tix&t), nmm.rn.mz 

»±ttSfeogNd*^ >^>St&«3- K§*itv»4 0 
[0 0 15] 

J: 9 fcHCVOflfct* Wmfm.fr b-Wi*<r># V 7° 

Core, El, 2StOT2) fc*«at^ (NS2„ NS3, NS4A, NS4B, NS5A. &tfNS5B) fcrt* 

ii*>v?*K (i-^t>*), HCV07-f ;v^? >v?^R) cD?^, Corel* 3 

ElS^ti^V^n-y^ >^°^®-e* 19, &mM?>^?M (NS2. NS3, NS4A> 

NS4B, NS5A, RWNS5B) i± ? ^ g #0««iJCH#1-* * >;^lt^^ o NS2{i ^ * 

nyntT-« NS3f±-fe V ^^0x7 — tfffitt(N5|aMi 03^0 1 ) fc^O 

(C*jSffl«3^2) £fca*£n 6 *iTv*S 0 J fcNS4A NS3 ©ynf 7- 

•e?§1$K*J"f £ 37r**--e*lK NS5BW:RNA#i#RNA**V ^ tffiHt*^-*-* £ t & 

1-TK*-&s*trv»* (§rai) o 

[0 0 16] 
[0 0 17] 

*9J*n#-e n > i **rLT:*5»)Hcv*>f/K**VA& l rft Si $ ntz rna * 

S*L*S»tS««l*^ri-4RNA*. itte^M2aOHCVfi*<7)V7 P '; 3^RNAfc«;1-*o #93 
«B#^*JV*THCVfi*Olxy«; =r>RNA{±, HCV-RNAV7*V 3 ^ £ 
[0 0 18] 

*S&9Jfci5V>-T\ r*^ffl2a©C!BffF3fctf>f /P*J Htfc^I^aOHCVj ti\ Sinmo 

- ffiSE#2 003-3107783 
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cv$v arna&^-t^ wefc < . 3e&fi**>Hcv*v ^wM^A^m^m^M 

jLitrs AWhi^i-r&V 4 Zo a^lB2aOHCVO*#« Utt, JFH-1 
tt&tfJCH-lfl: (trai) ^tf>*^/l/*5&*^f$>;H&o 
[0 0 19] 

Sfefc r^S : f-M2aOCMJR : ^:^^;V^<7)^/ ARNAJ W\ &{^ll2atf>CMJfH&'r 
;v * (D—^m. <£> (+) Ift-t y * RNA** h%£%>y / A \Zfrtz2> £ & RNA 

[0 0 2 0] 

*I«itK*V>-C, r5'^»fWBW (5'NTR3ZJ45'UTR) J „ IHS3* JC. NS4A 

?y;*?M, NS4B*V?^®> NS5A* R&tfNSSB* Wf*a- K-«E?!|J . 
[Core*ys*?W**- K-r*E?!l (Core«JftX{4C«*S) J n rEl* R& 3 - K 

(ei«j«) J , rE2* ys*?n* 3 - K-ra E?"J (£2«*ft) j > TN2* 

ta-Kt»ISN (NS2«J*) J „ rNS3*>VN-*@£3- K1-£E?!l (NS3«#) J , TN 
S4A* ys*?M* 3- K1"£ E#] (NS4A«) J , rNS4B* 3- Kt*E3?fl (N 

S4B«l*) J , rNS5A* W^R&s- Ki"*E^J (NS5A«J«) J > rNS5B* RSr 
a- K-t *HB^lJ (NS5Bfgi$) J „ W r3'MlRffii^ (3'NTRX{43'UTR) J „ &mz^<D 
4fetf># j£tf>«Hfc& L < i4TO(4> jtfcT^aOHCVC** JFH-l^O^y A^««* a - K 
•t^^rftcDNA (JFH-l^n-v) 0&&E?!] (@2^J#-^3) ^S*tU, gi6l)^t 
1"&o @B^lJ*-t3 0m^lS?!|{4. BI^DNAt'-^^^^ (DDBJ/EMBL/GenBank) frhT**. 
y v a y ##AB047639 kz£l) ^#pTI g T* & & 0 K f4 . WM&^r 3 t?^ £ ft S :&^E 

?!) fc*fr L T#5eOHCV?>RNAE?!] *77^>^>Mfci:i^ @B^J#^- 3 t?^ $ *l& 
E^I±0*afe*-^ 1 ~340£7 9 > ^ > h $ fLSE^flrt^ORNA© T5' ^tBIRISMJ > B 
^#■^-3431-9442^ T 9 >f >^ > h %ft&W&W TNS3* NS4A* V/^flU N 

S4B* >v?^|C> NS5A^^^^5t2StO ! NS6B^>/'«^ff4:3- Ki-SEJIIJ ^ |W|tS*#-^343 

i~5323KT74y*yvztt&mmi> t ri&3*y^?^*x-Y-t2>mni . ^*#-f-53 

24- 5485 7 9 > * > b $tl5EW TNS4A^ R * 3 - ^> E^llJ . 

-^■5486~6268lwT7-f >^ > h $ji4S5S# TNS4B^ ^/t^RSrn- K1-*E^!IJ . TO 
»-^6269~7666tC T^-f>>>h$ it 4 E^ll** TNS5A* y*? ® * a - Ki" S E^JJ s 
«3lf*#7667~9442^ 75^ >p< > h £*l£E?iJ^ TNS5B^ R^r 3 - K-ffcEflU 

% P!ttai#-f-9443~9678tC T9^>^>h$tt^ E^J^ S T3' ^^iR^itJ "C* S o * , 

T*>4V> 0 $f>tw±|B«0 T#^OHCVj t4, ISjg-T & fcO-Cfi&V^ JFH-1*^ L < t±JC 
ft h (Om^^ *> Z tf 4 )V &o 
[0 0 2 1 ] 

*$&W K# S HCV RNA- 1/^'J3>0-00 Hife^H 14 , 5tfgiFS2a<?5 C MM$k 4 fl> X 
O^VARNA±<7X 5'#^i^ ! Biitt^ NS3^>/n-^K. NS4A^>^'^^ NS4B^^/^^R. 
NS5A*>v^lC;j5.U?NS5B*>^R&3- K"t*E2?!)i:, 3' MsR^ith < 
tfJt36E^I^"E>5:*. W'^U a ^RNATfcSo £OW7*V 3 ^RNA{4, ^4 < 1 ooS 
^v- * -jffeT-^ L < {4 y ^ - jt^, »^«; < b i> 1 ooiRESE^J^r $ K-g- 

^•e^)4^ 0 $ e.^«^<7)^7 p v 3>rna(4, mttM22i<DCMffr$kv << fr?s<Dy ; arna± 

<7)> NS3, NS4A, NS4B. NSSAS.y'NSSB^ >/n*^R^<7)^ W ^7s 9 >;*?W* 3- Fi-.2> 

E?!l£> -^^•e^)4v^ 0 

[0 0 2 2] 

*I&W^*HCV RNA- l/yj 3>og!|^)$fit^^»S{4. ^Jff 9Xf± 1 0"C^$ 

ftzm.mw.mfrt>* zy&mmmi&t^ io©«iRv-*-»^^rL<ttu 

# - ^ - JUS^ t > IRESE^iJ t s «fei^S2aO C fiffF^f -f V>? J ARNA±ONS3* > 
/^°^R^ NS4A^>/-?^R> NS4B* R N NS5A^ R^.t^NS5B^ K^r 3 - K 

ffiSE#2 0 03-3107783 



#112 0 0 3-3 2 9 1 1 5 ^-v I 6/ 

1-3ie?y^ EBI*#1 1 2 ft* *££E^*&fc S3* *^tr«[ 

ftSe^iJ^ & £ , l/^ja >RNAt?$> « o - £ •CBB^J#-t 9 1 0 -Cjf $ *l* &£E7!) 
{±>. ^*L-r*U ;£3£gl3K^£ V/U 3>RNA-e^-2>rSGREP-JFHl (E#I3H§- 1 ) RtfrSGREP 

-jchi (ss^ij#-^2) o5**fliiR«J«oie5!l-e*So *fcE?u#^i lscri 2ttf$*l 

*ffi*E^iJii % -ea-TtL. 3>RNA-C&&rSGEEP-JFHl (E3W-£ 1 ) 

StfrSGREP-JCHl (E?!l*H§-2) ©S'^afrattOEBI-O**,, 
[0 0 2 3] 

#f&Wtef&*HCV RNA- V "7* y 3 LVM oo||lii3g»tt, U(i 

2 T^£*t£:^E^J^'b&£RNA^ib&& 1/y V a ^RNAT^^o ££>K, iO@B?Jf^ 
lX(i2*C^£;h.5tt2£Ey>Jfc;fev»T\ 1-501®, 1-301®, 1-1010, 1-61®. 1-|&1@(2 
-51®) «mXti#jjP $ tltiiMMmmfr ^JP^Ja >RNAT* o T, tf» 

iza^x rs#*«fl&**r-rsj tii, wry a>m£jfflfl&fHci»XL;fcfc#fc N -eo 

U-^y a >RNA«ftT^© V/'J a >RNAi#^I^Hf^ i t^fl ^ ^ t 
*1"£o K5c-r*iOT*{±5rV>^ Z(0&®m$mi±, WHS. U-rv a >RNA£Huh7#B 
Mt:h7^7x^y 3 >U Huh7«£ig*U *i£^%*<9|fflJj&^<bWiii L 
fcRNAtCOV^T, SALfcvyy 3>RNA^#m^«i±i'Brtg^7 p n-r^ffiv^c:y-^f ^ 

t|Bm$*L7tan--^|goa!l^ > HCV* >'*?M<D§&wmm. n VRNAO^m# 

[0 0 2 4] 

* -> > lf^-C7T>f$/V »teflt^?*J&**Mf t, ft* l-t^y^ jitttit 

1zrt\ ^5 v > * 7" - -If atlsT- £ L <U M&mtttf $ S> jfr * L v> 0 

is t *#&9§ k is it & suk^ - ij - mm* n £ *l «b t~ $ ti & & <d x « * v> 0 

[0 0 2 5] 

*fc?-> M>IJi&i7Wj7^^77^-fe- (SEAP) fflK^&mfbft&o IS L*|fc 
W £ It & y ** - 9 - fltig-T- !4i^e»C|»Jg$ti.S<>©-C(±35fcV> 0 
[0 0 2 6] 

[0 0 2 7] 

*l&WH*»t* riRESE^JJ fctt^ RNAOrtffi^y ^y-A*iB^S*TaiR*H|j&$ 

mtLXlt* &Tlzmfe-tZ> feO-Cti^V^«EMCV IRES (IS'M&ifc* ^ ;i/XOF«3g|5y #V 
-AjB-frfflSffi) , FMDV IRES. HCV IRES^^lf t> fL*** v EMCV IRES^lfflCV IRES^«t 1? 
§f 4 L < % EMCV IRESjJ**fe»* L^o 
[0 0 2 8] 

$ h iz^Wiz&z v/y a >rna(4, nfeoHCv^X{4lfeostiE^-acDHCv^ry arna± 

ffliE# 2003-3107783 



#M 2003-329115 



^-v : 7/ 



ttLtli, HCV-H, HC-JU HCT-18. H77, DK-7s US1K S14, HCT23, HCV- 

Ths DR1, DR4, HCT27. S18, SWK DK9 N H9(k TD-6EU S9, HCV-BK, T10. DK1> HC-J4, HC 
V-J. HK3, HK8. HK5> HCV-G3. IND5. IND8, PHh DK D3, SW2. T3. S45. SA10. US6. HC 
V-JK1, HCV-JK4. HCV-JK3. HCV-JK2, HCV-JT, HC-J2, HCV-T, HK4, HC-G9. Zl> Bi.S.I. 
, Cho.J.M., HCV-J6. T4. T9. US10, HC-J5, T2. HC-J7. DK1K SW3. DK8, T8, HC-J8> S 
83. HK2. HC-J6. HC-J8. BEBE1, HCV-J6, HCV-J8. HD10-2, BR36-9. S52, S54, S2, BR33 
-1. HKl(h DK12. HCV-TR> BA-1, BA-2. DK13. Zl, Z4. Z6, Z7. HK2, SAK SA4, SA5, SA 
7, SA13. SA6, NZL1, SA30\ EG-13, HCV-K3a/65(k ED43. EUH1480. EUHK2, Th580. VN235 
s VN405. VN004, JK049. JK046, JFH-1, JCH-1, JCH-2, JCH-3. JCH-4. JCH-5, JCH-6, J 
6CF, H77^* W h ft h i)K Zjxhiz R jg-t & i> <D X it & v> 0 
[0 0 2 9] 

*$kWfc&2>UzfV 3>RNAt±, jifiKli, ^t5'<PJj4CjtfS?-M2a<OHCV^^7 ARNA 
±.<D5'Wmmmi&*. ftfcS'flUcafc^I^aOHCVO^y ARNAJi<^3'#NS!IRfi^£^1-& 

ft, NS4A* W^I, NS4B* NS5A* 
ft£n- K-r&@S?!)J 0±^X«T»li;M$ttT^ iv^L, TNS3^>/^K> NS4A* 
W^f, NS4B*W^®> NS5A* >/^®&m<S5B* K-t&I£?!)J 

[0 0 3 0] 

^Ci-SV/'J ^ >RNAJi> i«HL< it, ftfcS'ftimm^^aCOHCVOry A 
mJiOS'^N^RWc&^U reai 19 kTsfLKil^-* t < it U * - 
itfS^t, IRESIB^Jfc. rNS3*W^Jf N NS4A*>/^«. NS4B*>/^«, NS5A* > 

fS : fl2a©HCVOy / ARNA_h03'#MIR^^*-r^>o 
[0 0 3 1 ] 

^MrtT»^$^fcV>fi^O^M^^^tfRNA^^-A,T'i £v> 0 ^stfST-fis 5' 

±^XtiT^^^$*T'i> iv^L> rNS3^ >/N°^Ks NS4A^>^^K, NS4B^>-^^ 

NS5A^ V^n^K^U^NSSB^ b'-f ^.Sfi^JJ OJtaXttTSEfcaUSLT & 

iV^L. TNS3^>^^R, NS4A^>/^K, NS4B^^^K. NS5A* 

K-TSE^JJ 0ti:#ALtU^ 0 fl-Sfrtfc^&^tf V^y a >RNA 

[0 0 3 2] 

-ettj: I)3 , fflOIRESEy!iaO r rNS3^^^JJ, NS4A^>-/^^K. NS4B^>^n^H 
, NS5A^>^^K3lO ! NS5B^ W?^»*3-K-t*E3?!lJ i: ^> <t 9 V 

[0 0 3 3] 

^5^230 CMJTF^^-f ;V^O^y ARNA_LOtf a- 

^ i^yy 3>RNA^<biELv^^#-eiaiR$tt*«j: ; 5 K&sssatSo ^-ttiboss^jcod 

, NS4A^^^^S, NS4B^>/^H, NS5A^V^^RR0 t NS5B^>^^Jt*3-^i-4E 

fcBSE#2 0 0 3-3 1 0 7 7 8 3 



#12 0 0 3-3 2 9 1 1 5 v : 8/ 

W frhWilRZfrZ>*°') * Z"i9ftt b fefclfrfr* >sifiMb LX%m2 "?v 

[0 0 3 4] 

KftffXi^ti^o RJ&t* fcO-Cli&vv^ HCV RNA-V/'J 3^ti. «x.tfJ!3L 
T60 X ? **ft-Cf^«-r Sit § S o 
[0 0 3 5] 

^RNAyn^E-^-OTiEKilMU DNA^ o - v fcffctWfco faNA *D 

NAJ ti±, ij^RNAO&^SB^JOU (^v^V) &T >0 t:f l3fe£fcME?lJ*t 

1-<SDNA&Sti*-r3o mffiRNAyn^e- ^9X5 n - ttS & <DX$) 
ZZtamtL^o RNA^n-^— $ — b LTIi, R5e1"& & tf>-ei±ft T7 RNAT'n^ 

I0 0 3 6] 

i$ (CoreIE?iJ, BIHEflU E2SE?!]) ;&tfNS2* >/<*Jt*a- K1-£IE?!j&> S^-*- 
itfc^ L < \±\stf—9 — it^T- 1 * ©Tift £JMg L IRESIB^J b £ -gftr DNAHlrJt Uio 

[0 0 3 7] 

£ o RNA£-j£ tt , 5' £MB9?&tt;& tfIRESB2F!l h „ ^rft J: t) fflfe £ -fr & £ t &X # & „ 

- ©Tift*:: ABr $ *l Jt ±IBDNAlg «c £ M Rg^Si K X o XW «5 ffi t T , Z ©DNAglr Jt * #31 fc 
UTfflv»-CRNA*^^*ifc*T?#a 0 ft*, Sff L< l±^$^i>RNA03'*^ ? ^-f 
;V**V ARNA03**«R«i*i:-*U^ i<^E5iJWJn$ ft* >9 HlfcStL*: Lft v>£ 
Hff L^o KLT-£-j££*L*:RNA^ #5&Wfc#3 3>RNA-C&£o 

[0 0 3 8] 

3 . flft^^aWHCV© V7 V n ^RNA£3*A UV^'Ja 5"ttSifflJ&©fEW 
3gEAi-&£fcteJ: ^ *>mLi!ffitiLmzW&Zi%X\t*&mi&*:&&Zt#X'%Z 
J »1"£o 

[0 0 3 9] 

l/^'ja >RNA£*A Itli, i fcrtW«&ft«-e£> tufted 

^irsftjfcMfl&H xti t c twx ^mtL<, mzmik. He P G2 

M> IMY-N9;jfflJ&. HeLafflJft, ;Rtf293iWlfe*<b4&#J: ^11^1. v>f ^omt* 
[0 0 4 0] 
[0 0 4 1 ] 

C ffiSE#2 0 0 3-3 1 0 7 7 8 3 



#M 2 0 0 3-3 2 9 1 1 5 ^- v I 9/ 

tf^fe. 'J#7x^y3^ U >Bfe*/l'i/SA8u W^n^f >*J^?i/3 DEAE 
[0 0 4 2] 

^:^^AltUv^ 0 ^A^&RNASSr-^LTl':/'; 3 >WA<DM;Z$tfkmttz 
^Wi&^te, EM<V\s-7V 3>RNA£, &«^fbffl£B Lfc h - * ;vMt±RNAUE 

f b< {±10^3^^-10^ ^D^7A(Oi^fmo 
[0 0 4 3] 

-«HT, V7*V rr^RNA^^^-v-f yMWimtttimiR^- * -Mitt t Lt^i 

3>RNA^fflv^r^BJjaF'g*A^SL/c«^^*T ? ^ 7->it: 

»SU 16~24B#^i$#b7t#iH. Jg«7*4 * v^-KG418 (**W~>>) £0.05$ V ^ 

>; v ^ h^~3.05 v ^7A/< v v ? h;i/?>jftj£-e«]U torn. MK2m^ 

»»£&L&^<big*&Ku »»^<b$fiL<{iioB^~40QK. i^fji< 

(±14 a F5^*L^^^ V * * ;w*-f tvyh -e^M^^-fei-^ ^ h iZX *) 
[0 0 4 4] 

RNA^^S<J«I^$tLTV^«^n->^, #I»#-Ct± ri^^J3«^n 
[0 0 4 5] 

#&L£M^ n-^i:ov>tfi, SA£ ixtz 3 ^RNA^kl&ffflJE^ n- >$>x~ 
MMZtiX^Z 3^RNA<«ai. aiAStt/zV/V 3>RNA*Ojl^-7J-itfKT- 

[0 0 4 6] 

iff A $ *tfc v 7 s y a >rna^ e> %*mife 9 n - > * $ ttfc v y; 3 ^rna (#93$* 

*t*{± n j^TMfi^ic, m&u-fv s^rnaj tfo-fz) o^ma. mm^^n^<D 

[0 0 4 7] 

3: fc#A$ V^Ja >RNA*Oiii^v- # -st^TXti y * -a«gT-0«±^* 

Hi|iE# 2003-310 7 783 
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z>^t &x # & o 

[0 0 4 8] 

CV* ^ * ®Jfi;#:^|tm-r & £ t J: o Tfi 1 a £ t &X # & „ ffllfe * n - > L 

£HCV* >/t^Jl3Wftffi**L*Ltf, ^ll^n-yii, HCVffi*<OVyi; 
=r >-RNAt5^^6<7 K^gjl L THCV* > rc ? S £3§3| L T V> & *> <D t %M 1" £ £ t &X § & 0 
[0 0 4 9] 

ymmnm, a > hcv * > / < * K £ a # Wig ic^ffl net 3*-e § 

**e>, mcv?>s*?Mtfctt.*m^?ztkmmi,z£ ^^^^^^f^i^cus; 

o 

[0 0 5 0] 

MKmmLx^&m&izit^ t^uyv^y^mmmm^m^x^m^K^- k$*l& 

[0 0 5 1] 

K# £ V 7 y a ^ ttSUOJfe * v> Ttttl $ ti£M &titz vyj =1 J'RNA (%» V 
7*y n>RNA) tti^**fa±$*;&3&&3E#*#«*fc&? , *o to^^M 

4^itI©fA SriEJi-r -Sit # & o 
[0 0 5 2] 

1 ^SI/T'V 3>RNA&$ ?>^gl|Oiffl^^|f^AbT^2^V7 p y 
tikv^IS*, lHJSLt, £P£ L < til—lOlak «£ L < »±1~5BL $£>K$f£ 
L< «l~2IsIR«fi«J^fi 1 9 £ kiz* 9> l-yy3>^»*T% l^yy 3>RNAtc«» 

[0 0 5 3] 

tt«U^y 3>RNA^^A^^mt LTti. tt*©iWll&*ffl^*ifcj&fc§*^ 
**FKV^y 3 VRNA$-iA L/cIJ| 1 1^ C^tft 4 ^ t L < > * 

*jtcv^y 3>m**AL^«i:|wii:^@ft^<oiwicffi^fi^om-e*^i 

mSE#2 00 3-3107783 
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[0 0 5 4 ] 

yy aJ'RNAfcMltB^teJ: >9Jfc#U $ ?>»ciO» 1 0«ttV/U a >RNA* $ P>^S0OjW 

00. J: «9£f3: L< lil~5l3. $ <b L < ttl~2HR«fi&fc*To]fcfk iOR«Xg 

TMt £ i t «t <9 > ttftfcb*****:*: U^'J3 >RNA* * C i: S & o 
[0 0 5 5] 

0 * L< til0~100l&. «fc L< (ilOO-lOOOOiWtCii^c^^-SClh^-C^^o 

[0 0 5 6] 

sb^ij £ . ^^HBaiwSx Hfcv^ja >wA<D&m&m t itm-r zztizx*), m& 

>RNAKa- KS*tfc*/f * >/^||?>T $ S Z&m&U&ifift* Lv> 

o 

[0 0 5 7] 

yy a^RNAK#&K j: ^xniiCiot, L fc*^Sl2aO C SI 

IFF 3^ ->>f^Ol/"/ya^RNA* » jt f & £ ^ § & 2Fffi *> ftifc-t £ o 
[0 0 5 8] 

[0 0 5 9] 

^(^SU^'J a>RNA60m»^«. S*#K&ftJO*ife»c * 
-e£&7& ? . |f|x.lf^J: 1 tttefrOtorVkfe-flXtf «fcv> 0 fctx(^Huh7« ^0.0001. 
0.0003. 0.001. 0.003. 0.01. 0.03. 0.1. 0.3. 1.0^"f * n ^ A<DM<D 1>?V a VRNA 
ih7^7x^i/ 3 ^LT, maom»¥ttfcR»W^"^G418KJ:*S^iS«=S:2 1 
0Plfio^:3n--®ift (3D--S) &«05e1-&o ^AL^RNAgfcan^-^ 
^ich^Jt«LT#*«65^ao--^^liini-S a^RNA^A*OHH*ft 
SeU ^«rtW3D--MS*> ^Ab7tRNA*-e^LT#ib^S'ft^. lv 

an--ijft$ [(Colony forming Unit; CFU)/v On/5A] = 3Dx-||| [i® 
[0 0 6 0] 

L "C#ai $ a n * . fALfc l/^'J a >RNA«Mj^£^-f -fit*: 

I'T'y a >RNAc7)^»^i±. gftjftfcans- lfflJbfcOO*AUl^^'J a ^RNA 

<^atf-^r-e^$tt-&3n--Mt|-eSi-i t t-ci^ 0 «t 9 ^fHf 

30--^M=iALfcW^'J a VRNAOatf-gc [aii-]/30;-MI [«] 
[0 0 6 1] 

5. *i&Mo#ontt»a8 

tBtE# 2003-3107783 
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* ¥05e1- £ - t X *) „ C M JfN£ * -f )V7,<Dn%t*femX\±WM-? 2><®)M* * 9 V - - 

o 

[0 0 6 2] 

* * « O i£r # <*> * <0 ?&^J ^ L < li^Bf #J O «3tX imm * g & i: "t & £ t x. <b 
[0 0 6 3] 

(1) jt^M2aOHCVc7)ii5g^^J-r^>%K<7)^ 

m^nMt Ltli, ^Sfl5 K5 ? 'f vXli^'f — 

(3) m^M2aOHCVlC«L^M#0^#0^i6(7). *r3l3cMt«J©ffi5e 

(4) hcv? w ^^o*ffl*j-wi-a»tt^ffi^ffHBRim»tt^iwfe*its*«)ra5fe 

(7) m{z¥M2amaizn^zy9*>kLx&mwm%y9r>$cMo&i& 
immm) 

[0 0 6 4] 

a* wttH « £ ti h mum k rj£ $ *l* *> o i± & „ 

[0 0 6 5] 

mmm 1 ] u^;a ^rnaoass 
(a) s&m^? ?-<dw^ 
mmjR&<ommfrbfrmLfccmiRifc*4 frxm-m («^f-M2a) j 

A&mmzW&i-Zim*, R^^^ftW A^cDNAfc-^trJFH-l* n - > tf> ib JJX# 

i:Ltii$^y7X5 KDNA*. J£lT\ pJFHlt^-T^o R^K. W^Sfi^ 
^•JBILfeCiaUffife^^ (it^S2a) tf> * >f A^HMlDt &DNA 

5 KDNA&. pJCHlt^-t-So ^*5> ±|BJFH-1^ o- VRIFJCH-^ n- V Of^®^ 

mSE# 2003-3107783 
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AOffiafeEyiJ fi, HBUDNAr*- 9*V9 (DDBJ/EMBL/GenBank) 07H'r>3 : ABO 

47639K, JCH-1 ?u-y O^rgcDNAO^ftlB^J ti T * -fe y v a : AB047640K:^^£ 

[0 0 6 6] 

ioi? fcLTflMJSLfc?^;*? FDNA pJFHl&tfpJCHlOflfcfi*. 13 1 <0±.WttZ7£-* 0 
l"T7j {±T7 RNA^n^e-^-, TGJ M\ JfXLfeJFH-lXttJCH-lft*DNAO 5 'SftO± 

5> T3'NTRJ JWCIJifli^^^O^ A^^^JS-r-SDNAT^So 
[0 0 6 7] 

^Ct*. 7° 9 .X 5 KDNA pJFHl£ tfpJCHl t gftfl itftft O - » £ > itv^yV 
WffitW (neo ; **^jis>*X* hy>X7 ^?-l£&[fc : ?-k1>>1fa-f&) J&umW- 
IRES (/fito fl& jfc 7 ;v * <D ftU *) # V- A «f ^ffi) "CflMft LT S KDNA pSGREP 

-JFHlXOfpSGREP-JCHl^-e^-P^LWiSUfe (HIOTS) o £*>**#JRIi, gfcffc (*# 
M7) tcflfeofco ZT^X 5 KpJFHlRtl f pJCHl*«IR*3SfAgeIRtl f ClaI"C 

WU -tOSUBrSBffifc. pJFH-ia3HWNTR^e>Core«J*K*5J:^EyytpRSV5NEOft* 
O * * v J -> > WttitiS^ t * PCRii (B K J: 0 jtS-S- L «0 EII£StAgeI Pmel t*m WLtzWiK 

[0 0 6 8] 

ttz, pSGREP-JFHlt»<^NS5B«*(i^ov»Tx ttfUAiZ a - K$ *l£RNA# V ^ 9 --tfOM 
tt+^tffi^-J- /t ; Ef-7GDD^GND^I$*-5«ll^iALT, ^ 
H 7° 7 * 5 K^n - > P SGREP-JFH1/GND ML^» 
[0 0 6 9] 

£ £ % pSGREP-JFHl 4> <7?NS5B«it 3 - K 3 ft £ RNA*° U ^ 9 - -tf <£> 

ffitt+^^ffi^i-*T5 ygft^f— 7GDD*^tp5gttL7t 1 0 75 SW$ffl*%.$<:Z'& 
IT, ^tI"/7^ U'^n- > P SGREP-JFHl/dGDD£{« L fc 0 

[0 0 7 0] 

**±IBt?f^» L fc^SEH * n - > P SGREP-JFHl/GNDS.tFpSGREP-JFHl/dGDD{i, 

[0 0 7 1] 

(B) aJ'KNAOftMR 

9 -pSGREP-JFHl> pSGREP-JCHl, pSGREP-JFHl/GND. pSGREP-JFHl/dGDD£, ^tt-FfUWRa 
^XbalT?«3WfL7to 
[0 0 7 2] 

&v*-e N itt^oxbai-ejBfffiffro^^L-rtt^ov^T^ 10-20 /<g£ 50 ^lo^iSf k^- 

Mung Bean Nuclease 20 U £fflv*T 30X^3035* [8M >3r j.^? — h-f&Z. 1 1tZ£ t) 
, ZbizymLtio Mung Bean Nuclease^ Z^HSD-NAif?) — ^■^k^^^M^^JK^-M't 
*RJ6Srj»jaii-*»a5t?**o SIHT, Jt|EXbaI^»fif>T-*^0^$»Mi:Lrfflv^-rRNA 
^JSfcfi'd t . XbalOBKE^UO— »-C**CUGAO 4 Mm&3 ' $ii%K&frKl?to\Ztifz 
V?V a^RNAj&i-g-JfcSfLTL* 9o ffC^fll-Ctt, XbaI1TO®f>f£Mung Bean Nuc 
lease-C^LS-r^C: fcfc«fc <K XbaI«J»f»fM-3&*?>CUGAO 4 bfco iOft, Xbal 

i^BJf®r#£^trMung Bean lfaclease*fcS&0»tfcteov*"t\ * 
**SsSKJ:lj % CUGA©4«afe**8Sf*$tLfcXbaI«JllfWfM-«:*ll»t'C^ £*L&i#S!!DNAt L 
tz 0 

[0 0 7 3] 

iO#MDNA^«b. T7 RNA*°U ^ 7--b*£/BV*TRNA£in vitro£-j£ Lfco <1 ORN 
A^H{±Ambion*fcOMEGAscript=Srffli/>3t 0 #JMDNA^r0.5~1.0^-f ^ A«&trRJSifc 
20^ l*«aMt#©«ffl«W»»c|£oTEJ6$*^o 
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[0 0 7 4] 

RNA^jfcfcT&x RBmmzDNsise (2U) SrjgJD LT37"C-?15$HHRJfc$-fr7!:|fe, $ 
&K*tt7*>'-/WlJ:*RHAaiai*fToT, «FSDNA*Bfe*Lfc 0 iO± ? LTpSGREP 
-JFHl, pSGREP-JCHl, pSGREP-JFHl/GMK pSGREP-JFHl/dGDDt'fi^-f <2>_h^O#|g!DNA^ <b 
£-JjfcLfcRNA (V/Ua>RNA) -e^-T^rSGREP-JFHl, rSGREP-JCHl, rSGREP-JFHl/GN 
D, rSGREP-JFHl/dGDD t Ltz a Z-fihOV?*)* >RNAcD^@e?iJ £ , rSGREP-JFHl (C 
O^XitW&m^ !RV l m2^ rSGREP-JCHlKov>Tl±BB^J#4!-2;K.a f |S3, rSGREP-JFHl 
/GND&zov>Tl±@5»-?|- 7 , rSGREP-JFHl/dGDDKov>T{iIB?!)#-S§- 8 K^-To 
[0 0 7 5] 

[H;S&#!I2] 1/7"'; 3>fia^n->« 

(C) ls7 } ) 3 > RNA<£> F7V^7i^'>3 ^ A Y^^xy 3l?zs3>MJ&<D3U~- : J&$, 
m <D « Jg, 5DU^D-> CD#3t 

±EO^il/7 , y 3>RNA (rSGREP-JFHl. rSGREP-JCHl, rSGREP-JFHl/GND, rSGREP-JFH 
1/dGDD) <D-£*L-?*;ft.Kov>T, ftiOl/ 7°U 3 >RNA^Huh7»^ fbttft L/c h 
;V^ttRNAi:^tT, RNAi^g^l 0 ,u gt & & «!: 9 KiH^ L£ 0 ^V>-e>g-?>7l^RNA£ 

o£Huh73fflfl&£ig*T'-< r/ai;fiU 1 6B3W><b2 4 Btf^it* L^K. *g#T*-f 
•^va.tcG418 (**W>>) ^^^^jtK-C^PL7to -€-^>m. 1C2 ®mmWL*$i& 

^RNAM 1 ^ gig tz 0 K# <b ti£ n n ^-^n-jf L£ 0 
[0 0 7 6] 

£ tt7trSGREP-JFHlX«±rSGREP-JCHl l>7^7x^y 3 >^H^tCO 
v^Tti, _tfBcDig«2 1 U&<DtgmT<< v is ^frbZhK&.ftmm<Vxv 

[0 0 7 7] 
[0 0 7 8] 

(D) #h7>^7x^y 3 >^^Hj3tt-&^n^-^^ 
±IB<Dh7>X7x^>'3>^I x h7>^7x^V3 ^t7tl^7 P V 3 ^RNA 1/^g 

^ 7t O 3 n - -J&j&m*, rSGREP-JFHl & F7>^7x^y 3 > L7 v wHuh7#BJ|&-Ci3\ G41 
8M^ S 1.0 mg/mlO^. 94700 CFU (Colony Forming Unit; 3nx-|Mfi)//ig-R 
MX*$>otz (H4<7>7i^!j) o ^ratt, rSGREP-JFHl/dGDD, rSGREP- JFH1/GND te^fl 
^tih7>X7 ^? V3 > L£Huh7*fflJfe^{i, =?n^-^^M*i)'btL^^o7t (i4<?D 
^^JW^iJ) o £ co £ t ti, rSGREP-JFHl vy; n >RNA£ F7>X7x^-> 3 ^U 
Huh7H8)3&c0 3n--^ti{±, rSGREP-JFHl^fb#£5L£;ft,&NS5B (RNA# U ^ 9 - -If ) 
ttlC«LTV^^ i:£^1- 0 o|;tK 3ni-«|Lfc»t(i, rSGREP-JFHl^ib 
l&SS KZ>KS5B<Dltfz b%K£V) rSGREP-JFHl V7*'J3 >RNA#*g#1»1-& itCiot 
, v^Wtt^^^^^l&^^tLG^Slttt^il^^^^^*, «li3t^nr 

m~ & o £ & <d t h ntz 0 

[0 0 7 9] 

— rSGREP-JCHl ^7^7x^v 3 >L tzHuh7fflMXl±, G418it^n~0. 5mg/ml 
<Dm&fcitx u --fcf&fimtb btlttfr^tz (|H5) o G418itJS^0.25mg/mHCT^^ 
It. rSGREP-JCHl ^f7^7x^y 3 >L 7tHuh7«T» & a n =. <b *t 

fco 

[0 0 8 0] 

tH|E# 2003-3107783 
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£ K , ±IB ( B ) T?# <b ft fcf§5i^ ^ * -pSGREP-JFHl OXbal© irKfit £Mung Bean Nuc 
lease-C^S-^-f tCRNA^O^MDNAt Ltlv^t V 7° V 3 ^RNA^^Jft U Ztl*±f&( 
C) IWI^tc L-CHuh7^atc h 7 7 jc. ^ v a ^ L/wo Mung Bean Nuclease^gl^^f t> 

[0 0 8 1] 

<T60^^. Mung Bean NucleaseT^ai-gr-f KfeWLtzVfV 3 >RNA& h7>^7x^ 
v a > LfcHuh7ifflJ&© 3 n --^j£fBfci\ 512 CFu7> g • RNAKI&T Ltz (HI 6 0£« 0 

[0 0 8 2] 

[&ttff3] 

(E) 1/ 7* V ^ 1/ 7 'J 3 >RNAOff b 7 >^ 7 x ^ y a > 

2 Kfl£o TrSGREP-JFH14>Huh7M^C9 h^y^i^Vg^lCJ:*) L£ V 

[0 0 8 3] 

S—fVTUy YMfrit^ Molecular Cloning, A laboratory Manual, 2 nd edition, J 
. Sambrook, E.F. Fritsch, T. Maniatisl\ Cold Spring Harbor Laboratory Press (19 
89) OiB*Kfi£oTfro;fco W&fc ?>atBL fcRNA* 7 # n - * m$C&»> U & 
»»T&fc®RNA*sKS>T-'r - n VJRfclE^Lfco pSGREP-JFHl*<bf^KL7t 
32P7 L fcDNAS tz (iRNA^n - 7*£ N ffim<D t is 1) HtHE^ t fcRNAfctt ^f7*V^ 

7*'; n^Hft&RNA/^ K&^tfj Lfco 
[0 0 8 4] 

V7*'J =J^RNA<O i 5eJ:fi&RT-PCR^«tS^ffif± > Takeuchi T, Katsume A, Tanaka T, Abe 
A, Inoue K, Tsukiyama-Kohara K, Kawaguchi R, Tanaka S, Kohara M. Real-Time de 
tection system for quantification of Hepatitis C virus genome. Gastroenterology 
116: 636-642 (1999) fcfi£v»HCV Wk<D5*&MWtmm<DBWi*titm't&Z t iz £ t) £>Z% ^ 
tzo ^-ftftnc fi, mMfrbmLtzWAK&tti& VfV 3>RNA£> R6 
-130-S17, 5' -CGGGAGAGCCATAGTGG-3 ' (Efl|«Hg- 13). R6-290-R19, 5' -AGTACCACAAG 
GCCTTTCG-3' (@B?!)#^ 14), TaqMan Probe, R6-148-S21FT, 5' -CTGCGGAACCGGTGAGTA 
CAC-3' (Xmttl 5) tEZ rTth RNA PCR kit£Mv>TPCR*ti|>§U &V»T?ABI Prism 7 
700 sequence detector system^ X *5$&ttJL7t 0 
[0 0 8 5] 

3tf-^^^o*§, l x 1 0 7 3f-(?)V7>j a >RNA£"g-trJ: 9 KWiULfco h:?>*7 
* ? v 3 > £_h|B(C) t WlUO^X^\ ^v»TG418»JK3^1aB/mlO*#T-ca^i&?aS 
Lfcfci^ U7*'J 3^^®^B^onn^-^^t46^itjt: (0 7) 0 iOf^n 
n--^tgfi % nbKtz^u--%tfrhnn~rz>t^ yyy^y^^^By\,zm^tzV 
7*'J 3>RNA«D 1X1 0 6 3 t?-^fel9 1 an^-JBUfCifcofco 
[0 0 8 6] 

— 2f* pSGREP-JFHl£«|g!!£ UTT7 RNA*''; ^ 9— IftJ: ^tfft^Lfcin vitr 
o£-jfcRNAtf>3fcf-8fet± x RNAOS*tRNAOg$^t>ff^[i-^ t*?72 X 1 O'^k'-^g- 
RNAT&lK iOin vitro^RNAfcifE t TOtc LT h 9 ^7x^V3>KMv^ 
03n^-S^ti> 5xl0 7 3t'-StDl3D^o^ £fl)iH»**6 
N l/7*'J 3V«»JWfl&^^«lffiUfc«IJlta3|6RNAfcin vitro-£j£RNA&l^ Da e-ffcO 1^7* 
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[0 0 8 7] 

[»J4] 

(F) 3 ^RNAO^ffi 

±K(E) Kt£o TrSGREP-JFHl<7)Huh7»^0 h7>^7x^y 3 >) Lfc V 

^ U a ^ttUMBIS * n - > <b W# L h - * ;VRNA£ $r &Huh7» Kffh9^7x 
* v 3 > Lxm3LLizfflJ&9u->[? u-y l - l 1 IStf-e^o?'-;^ n-> (a 

/HftffifefcJ: *> h-*;VRNA£JSffiL£o *v>T?£<B h - * ;i/RNA£ 7 -ifyrfu y h&K 
£1)ffitirLtz 0 fu~yt LrJipSGREP-JFHl4fHWrn-y*fflv^ 0 jttitLtli, 
h7>A7x^ va ^£#oTV>&^Huh7M/^IW&cfflmLfc h-^RNA (HI 8 f» 
, THuh7j i: L T^1") N Huh7j8fflJ&tf» £ fflffi L h - 9 )\'Wk\Z*mMW%$L Litis"? 
U 3 >RNA£ 10<Z>7*3 e-Jpx.fc^>"/;V (0 8 1 l"10 7 J t Lt^t) , JtWHuh? 
M^ttffi LizY-9 ^mk^WL^nx^WLLtz VfV 3 >RNA&10O 8*3 fcf-inx 
iz*r>7)l< (0 8*, T10 8 J t LT^t) *ffl^;oi8t, 1-1 lliffl^n- 

[0 0 8 8] 

£ Ojgf*^ rSGREP-JFHl MjgO* £ § <^»RNA^pSGREP-JFHl#^fi<J 7"n - r K J: *) $t 
(0 8) o t*U^J:!K *%0fc h7>X7x^y 3 > t^rSGREP-JFHl^fi*•r 
^ w 7 s n >rna^« * d - y n xm.rn.nm lx^&z. t tmm 2tnz 0 z izmik 9 n 

->fflX, SIV7'J 3>RNAOS^H^^,^h^$tL7to 881 ^n- 
^2, 6, 9> lOJi^SSvyy 3 >RNAOfij& ? #- < , ^n->4. 8, lll±*«V-/y 3 ^RNA 

[0 0 8 9] 

(G) *^"f ^SH&tfe^OJV ADNA^©&*&*©#4BOilR 
^Jfiifl3©#MfclTOfc LTffcRLfc. V7 p 'j3yRNA^h7>X7x^-> 3 >U 

#ibW;«^a->(:o^ n^^^G418^*t-r^#1475 ? ^^^^ v># 

(«fe>^7 p 9^f v-. NE0-S3 I 5' -AACAAGATGGATTGCACGCA-3' ( 

mmtt 16), rvf-fc^y^'fv-, neo-r : 5 • -cgtcaagaaggcgatagaag-3 • 

1 17)) ^fflv^-c, mi&?v->frb&mLtzm^mM<ory&im*mmt^z?cm 

(Bfcffofco fflv^;»?n->|i, rSGREP-JFHlfi^OMV-yV 3 >RNA£?f h7>^ 
7 j. ^ v a y LXnbtiizmM? n - ^ l ~ 8 (rSGREP-JFHlS5fcM^ n - y l ~ 8 ) 
x rSGREP-JCHl£3fe©«» >-RNA£?f h7^7x?y 3 > LtH e> tLfcjfflJfe 

^n->l~6 (rSGREP-JCHlfi3MffiJ&^o-> 1 ~6) X&<oiz 0 ^(O^^ HI 9 GCtf 
fi:i3»J> rSGREP-JFHlfi*?SB^^n->tcov>T{i. P^Tt 8 * n - >1 *:frvf3/ 

vwttaeToittii^ $ *tfc m®. 9 n - > ttf&«e> t,^* 9 £«, rSGREP-jcHi 

f|Ol/->3) o £0»tt?n->li, rSGREP-JCHlfiS£om^7 P V a 

w 5/ v#ttas^*qg±aai&o^'y aim* *t^i tui^ G4i8TO^« 

nLtzt^x-htitZo £«»tt^n->|:iv>Tll, flfco^ n - > fi^^ J) „ i/^Ja 
>RNAg#{±»rtT'e*^LTV^V^(7)t#£f>tL7to iO^tli, ^(D(H)IC^ 

[0 0 9 0] 

(H) HCV^>/^^R^m 

rSGREP-JFHl^^rSGREP-JCHl f7^7x^yg ^^Bfla^ n - yfrhH&K. 9 
^K^ffiffiLT. SDS-PAGE^0 f -7^^^^-7*ny hft^J:fj##fLfe (HI 1 0) 0 fp*fc 
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~ 8 , WrSGREP-JCHlft n - y \ ~ 6 Xh & „ * NSSifi^Sr-S-trf&il"/^ 
* ^ KDNA*Huh7«BJ!& t F7>vx>H:h7^7x^y 3 >U#^ tt^^m^ 
£»tt*N5& (NS3* W*?jR) Lfc 0 F?>^7x7">3> L"CV*&v>Huh7$B 

tz* >/*?MWV&*?WFWk (Immobilon- P , Mi 1 1 iporettM) l^Dyfrz/U JftNS 
3#*fi5$tfr (Dr. Moradpour X *) ftfc £ <T> \ Wolk B, et al, J. Virology. 2000 

; 74: 2293-2304) *m^XW& WJ 3 >RNAK3- K$*LTV*<g>NS3* W^Rfcfcffi 
t fco 11 1 0 KtfS *t£ £ *) . rSGREP-JFHlft3feHlfe* n - y 1 ~ 8 StfrSGREP-JCHlfi 

**rSGREP-JCHia*«BI&^n-> 3 (±IB(G)T?»te* n-* t L -OfctB S * n- 
f) -Cfi. NS3^^^ffo|&atttft(B$*Lft*»ofco 1"&*>*>. rSGREP-JCHlS^M^ 
n-^3-ef±> U^'J 3>RNA^>|ft»lt±fllK$*t&3&»ofco !>7>X7i7*>3 , /Uv» 
*V^Huh7»-eNS3^ Jt7&**feffi £ *lfc*»o fcfc** NS3* Jtj5**fcffl $ ftfcUBJfe 

^n-v-ctt, F7^7x^y 3 7^/:k7" , J3 >RNA^S#^1-^> i 2: «fc ij NS3 

[0 0 9 1 ] 

It IS L /Co 
[0 0 9 21 

^©(Gj&tfCH)©*!**^ l^V 3>RNA& h7>^7x^i/a>LTtlU 

M:?n->-ci±> w; a >KNAiJ s WS&ztix\t*& £ ttftfemztitzo 

[0 0 9 31 

(i) m&&m<Dm%r 

3 "C\ rSGREP-JFH10Huh7#fflJJ&^?> h57^7x^i/a> £^T#;i Lfc 

V 7 U 3 >%fifflll& ^ n - > h TO L h - 9 /HiNA* *r &Huh7#BIJ& fc|¥ h 9 > * 7 

*) Y - * ^m^^-F^Etl Lfco £ O b - 9 /UlNAfcttfflfc LT&^5£8J*Superscrip 
t II(Invitrogentt^)fc7'7'l''7-9641R-IH (5' -GCACTCTCTGCAGTCATGCGGCTCACGGAC-3' ( 
@B^iJ#-t 18) ) K <fc Y) V -7 V 3 ^RNAKftJfc-*- & cDNA£ ^ 1 aME¥RJSK «t *'cDN 

[0 0 9 4J 







5x 1st strand Buffer 


4 


2mM dNTP 


5 


0. 1M DTT 


1 


9651R-IH ^7^7- (100 /zM) 


1 


DW 


6.5 


Sample RNA (2 mg/mL) 


1 


RNasin (Promegatt^l) (40U/> L) 


0.5 


Superscript II RT (Invitrogeirfii 


S) 1 




20 ^/l 



[0 0 9 51 

cmA&f&Rmt LXit, ^T-tlS^RNas in t Superscript II&W-tD&MZU&LXm® 

mz RNasin t Superscript II*ma LX42VX 1 BSW&J&S-frfc^ $ 

tblE# 2 0 03-3107783 



#K 2003-329115 



^-*J: 18/ 



[0 0 9 6] 

to 

[0 0 9 7] 







mum 


77-f7- 1 




A/ 


42S-IH 


433R-neo 


41 - 470 


B/ 


C/S17ssp 


4680R-IH 


28 - 3026 


c/ 


4534S-IH 


7279R-IH 


2880 - 5625 


D/ 


7198S-IH 


9367R-1H 


5544 - 7713 


E/ 


9247S-NF 


9576R-NF 


7597 - 7960 



[«2] 









42S-IH 


CCCCTGTGAGGAACTACTGTCTTCACGC 


m?m-f% i 9 


C/S17ssp 


CCGGGAGAGCCATAGTGGTCTGCG 


mmm^r 2 0 


4534S-IB 


CCACTCAAAGAAAAAGTGTGACGAGCTCGC 


m&m^r 2 1 


7198S-IH 


GGCTTGGGCACGGCCTGA 


mnmn- 2 2 


9247S-NF 


GCGGTGAAGACCAAGCTCAAACTCACTCCA 


mnm-^r 2 3 


433R-neo 


AGAACCTGCGTGCAATCCATC 


&mm^ 2 4 


4680R-IH 


CCCGTCATGAGGGCGTCGGTGGC 


mnm^ 2 5 


7279R-IH 


AC CAGC AAC GGTGGGCGGTTGGTAAT C 


IB^J#^- 2 6 


9367R-RI 


GGCACGCGACACGCTGTG 


mnm^ 2 7 


9576R-NF 


AGCTAGCCGTGACTAGGGCTAAGATGGAGC 


BB3«-*t 2 8 



[0 0 9 8] 

£ ©PCRJRJ& iztsfth te}-lT(Dbi$l)"C$>Z>o 







7"7^v-l (lOyuM) 


1.0 


7°5'fv-2 (10// M) 


1.0 


2.5mM dNTPs 


5.0 


lOx LA Buffer 


5.0 


MgCh (25mM) 


5.0 


LA Taq(TAKARA) (5U//K 1) 


0.3 


DW 


30.7 


mM cDNA 


2.0 




50 fi 1 
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[0 0 9 9] 
[0 10 0] 

JHJ t rSGREP-JFHlOgfi^iJ t ©JfctJtfcffo fc„ 3 fcjjr*- 0 

[0101] 
1^3] 





MM 


mm 




NS3 


0 


5 


5 


NS4A 


0 


2 


2 


NS4B 


0 


3 


3 


NS5A 


0 


7 


7 


NS5B 


3 


5 


8 




3 


22 


25 



[0 10 2] 
[0 10 3] 
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*&«#^ 








75 


CI 


7098 


A 


G 








7157 


A 


G 


Y C 


2824 


C2 


4955 


C 


0 


A =» V 


2090 


C3 


4936 


A => 


G 


T =» A 


2084 




5000 


A => 


G 


y =*> c 


2105 




7287 


A => 


G 








7288 


A =4> 


G 


M =J> V 


2868 


C4 


5901 


G 


U 


E =» D 


2405 




6113 


A =s> 


0 


H L 


2476 


C5 


2890 


A 


G 


K => E 


1402 


C6 


7209 


A =* 


G 







[0 10 4] 

$4X^1 i kssv^t, rci~C6j f±> w^m^tL/cvy; ^rna^-t 

& 1/ 7° y =r >^MJ& ^n->Cl~C6 £^To n&*##J lil/^ya >RNA rSGREP 

10 1 0 5] 
[0 10 6] 

[0 1 0 7 J 

[%Jfc0l6] 

( J ) Huh7«m^oa J& & v *tz v 7 y a > M)| * a - y <d®3l 

REP-JFHl£ h5^77x^->3 ltf^r>iti:|iLr^f Lt, =ju^-5£ 
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[0 10 8] 

(1) He P G2,»a (Yh&imm.hm~ft&m%m-mm]&^tb&) 

(2) IMY-N9*IBJfe (Ito h K £ *) $ *tfc 0 HepG2« huh *J-ft**Jff W/lfe 0»-&j»fflfl& 
;Hepatology 2001; 34: 566-572) 

(3) HelajfflJE (V: h?-&l(ttA3teMlfe; Can Cer Res. 1952; 12: 264-265) 

(4) 293MB (fc M&B*ffi*jHHJia; Gen. Virol. 1977; 36: 59-72) 
[0 10 9] 

HepG2*ffljfe, IMY-N9«, HelaMB, 3U4293M£fflv>*: h 7 >X 7 jr. * -> a 
=K -e^-PttSl 2a~dlC^i" 0 HI 2a~dtc^fc* •? > HepG2«. IMY-N9«> Hel 
a^^ &O f 293*BIJfett & IC N rSGREP-JFHl f7>X7x?y 3 iz& if & a n ~ - 

[0 110] 

3>&$mM?V'-> kLtzo i-*t>%, rSGREP-JFHl£JBv>&£fcfc:<fc *3 
, HCVV'7 p U3>'tt»JpfflJfeOf^m^35&LTV^**o7t: (Blight et al., Scienc 

e, (2000) 290, 1972-1974) , Huh7W«DfF*JlBllfe^ S.^fflFfflflaT-OHCVV^'J 3 >^ 

[0111] 

(K) Huh7jWI&Jei^<D«B]feSrfflv^P"/U a ^ttJMfflJfetc^tt* ^'Ja ^RNA^tB 

if>^n-y (Molecular Cloning, A laboratory Manual, 2nd edition, 

J. Sambrook, E.F. Fritsch, T. ManiatisSK Cold Spring Harbor Laboratory Press ( 
1989)) OSEttfcftoTfifofco ffaR(J)fcf£v* N rSGREP-JFHl&HepG2, IMY&& WiHeLa 
*Wfe^© h7>^7x^y 3 yi:J:>) #3: Lfcl'^'Ja ^tt«iWI!&* n - ^ , WrSGREP 
-JFH1^239«^Ohv>X-7^^v'3 ^KJ: i)#;£L;fc I'T'y a yi&Wm&<D7-fr? 
u-y (a n - n - i 7 y -frh, Wl 

ny fft(:j;f)itL/:o -fu-f t LT(4pSGREP-JFHl#^^7 p n-'7^ffiv^c 0 MM 
ttttt, F7>X7x^y 3 ^Srffo-CV^V^HuhTjMI&X^ HepG2 fflflkfrbMW-fcifo 
ffiLfch-^/HlNA (Ell 3<f, Ianel25.r>l7) > Huh7JWJ&* •bJffitfc L7t h - 9 
^l*I"e^JiS&L^pyv a>RNA*10O7^a tf-*>-SV>(± % 10O8Hta e-Jpx.fc & <7> ( 
EU3*K lane2 &0>\ 3) £JBv>fc 0 rSGREP-JFHl £ Wl%£.]f£<Djz § £ <£>RNA 

**pSGREP-JFHlW*»^ , n-'/nj: ijlftmStLfc (il 3) 0 £"tuc«tl) % f^(:>7^ 
^7x^yaH fcrSGREP-JFHIK fiSfef 51^7*^3 ^RNA/&*«* n - > ft"?WB&&J& L- 
Tv^ifc^nftBStL^o i »K ilV7"'J a>RNA<7)S(C^^^ *9 , I 

MY «(4^a6T«4 < a^RNA fcttIM-* £ t 7&*9J fcsHcfc o fc e ttz, ?u 

- > IHIT«» a VRNAOffcHIS*** * £ #58 fctf^fco 7t 0 
[0 112] 

(D w ->>wit*!s^-©$v ADNA^^)ja*-&*o^r^soiiK 

avmttJMBUfe^n- j/fcov*-r\ -?-?>,M^ n- >OG418K*j--T& 

tbiz^ ^*'?<< i'>m®iAitt!&mt)77J'?- (t^T*?^?-, NE0-S3 I 5'-AACAA 
GATGGATTGCACGCA-3 ' (E?!l#^- 2 9). T 4 NE0-R I 5' -CGTCAAGAAGGC 

GATAGAAG-3' (Wffltt3 0)) £/Bv>T, U Lf^±»0^7 ADNA 

**»Si:-r*PCItftMB«:fTofco fflv^tt^n-y(i, HepG2 «K rSGREP-JFHlE&Jfc 
^liV^y aVRNA=S:|fh7>^7jc.^'>3>LT#<c>tt7taJ3&^n->'l % 5. 7. 
8. 9. 10. IK 12, RV1 3, & <b TFKIffl N9 «^ rSGREP-JFHlfi^omi^ 
3VRNAS:fih7>^7x^^3 >LT#f>tt7tm^o->3. 4, 5> 6. 7 

, ffiSE# 2 003-31077 8 3 
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, 8, 9 , 10, %.Tf 1 lt^o fc 0 - O^*, m 1 4 fc^f 2: *3 , HepG2MKrSGRE 
P-JFH1#AL£M* n-v(:o^t(i, 9 ? o - ^^^-f v >»ttitfe?- 

OliiliI^$^7tl^tt^n->{iM*e>^^^o7t 0 IMY N9«lCrSGREP-JFHlSAL^ 

[0 113] 

IWK, Hela MtCrSGREP-JFHlfi*^^l^y; 3 >RNA£W b 9 7 a > 
LT#<b tltzfflM? u - ^^.raSM^rSGREP-JFHlfi^O^vyV 3 >RNA£|? h 7 

[0 114] 

(M) hcv^ y^^Mo^ta 

m±LLt:nM?v->frb1gmK£ 0 *>/^«£tttBL-C, SDS-PAGETlt^^**^ 
/D7bi*t:ii))»lfLfc (Ell 5) 0 II^M* n- >{±, tismxm^tz i> <D t Wl C 
X$>*), HepG2 fflMl' rSGREP-JFHlfi^O^l-y; a >RNA£S b 9 7 ^ ~> a > 
LT#<b*L£#BIJ&* n-> l N 5, 7, 8. 9. 10, 11, 12> 
IMY N9 ^SJE^rSGREP-JFHlfi^O^vy; 3 >RNA£?S h7^77i^ya> LT# <b 
*itzmm.?v-> 3 , 4. 5, 6> 7, 8. 9. 10, Wl lt'Wo tfc, Bfc3t (Le 
hmannet.al., Science, (1999)) Clot, HuH7KrSGREP-JFHl£i»A LTfpmL/cHCV 
RNAl^V xy^Wkfflm?v~->*M&*iM (Hi 5 lane 4-1, C6) U/Co £<bK, b 
7>^7x^ys ^ LTv^v^«^f>tttB ^?@&l&1£*tfl8i: Ltlv^: (H 
1 5 lane N) o ^ti^ti<Dm&? n - >frbMlh Ltz * Sffc&SrPVDFil (Immobil 

on-P, Millipore*±5£) i:7*nyfO^U ^NS3#^fi^# (Dr. Moradpour <fc ^-5- 
Wolk B, et al, J. Virology. 2000; 74: 2293-2304) ?:fflV^llV7 ,, J 
3>RNA^3- K$ttTV^NS3^>^^^K?r^mL^o HI 1 5 ±m~7FZ t £ *) , rS 
GREP-JFHlfi*^^^ V 7° V 3 VRNA^ff b7V^7x^ya> Ltf <b ttfcM^ n - > 
1, 5, 7, 8, 9, 10, 11, 1 2, RXTl 3, & £ UH'IMY N9«CrSGREP-JFHl 
S*©Sf V/'J ^ >RNA^S b 7 >X 7 x ^ -> a > LT# f> n - > 3 , 4, 
5, 6, 7, 8, 9, 10, Rtfl IT-}*, ittfflt |S| £<9* >/^H^m$ 

[0 115] 

5tM£i3£&o£o NS3 ^ WCtrbfr*m#JfiLTt^^x.T^^^f o 

tz 0 HI 5Tgi:^ti7^ rSGREP-JFHlft*OMV'7 Pi ; s^RNA&Sb? 

^7x^-> a > L"C#<b^M^ n-> 1 % 5, 7, 8, 9, 10, 11, 12,25. 
Xf 1 3 , & klTKIMY N9M^rSGREP-JFHlS*0«W7 e V 3 VRNA&S f7>^7x^ 
->3 y\^X'&bKtzmfk?V-> 3, 4, 5, 6, 7, 8, 9, 10s RlfllTtt, H 

[0 116] 

IWK, Helaa^^rSGREP-JFHlS^O^Ml^T 0 '; 3 >RNA=Srff b7^7x7'>a>L 
Xftbtltzmifc? n-^7i.^293mt"rSGREP-JFHlfi*O^MV7 Pl J 3 >RNA£S b 7^ 
^ 7 x ^ a > L t|# ^iH^ n - > i;o v> t , TOo^it^fo £ t i NS3& 

[0 117] 

m±Oti3i9, l/7*V 3>RNA£ f7^7x^ya y ITf £UclI^n->tii 
„ 1/7*'J3 >RNA^$g $ tLT v» 4 £. tmm £ Hfco 

[0 118] 
[JUfe«7] 

(n) m&%zm<omtiT 

ttJlE#2 00 3-3107783 
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m ±L L tz V 7° ' ; n * n - > ^ ib m% L rt: h - * ;VRNA £ if & & * 0« 

F7^7x^y 3 >U, HepG2«^ov^T{±14<7)m^ n-y^r, HeLajSfflJ&Kov> 

^tl-T^Jfiiti L7t 0 £0 h - * ;HWA£»1IK LT&ijg^gt^SupersCript II(Invitroge 
nttm) t 7*7 J ^-9641R-IH (5* -GCACTCTCTGCAGTCATGCGGCTCACGGAC-3 ' 3 1 ) 

[0 1191 







5x 1st strand Buffer 


4 


2mM dNTP 


5 


0. 1M DTT 


1 


9651R-IH 7*7 -Tv- (100^ M) 


1 


DW (a£^7X) 


6.5 


Sample RNA (2 mg/mL) 


1 


RNAsin (Promega*±§SD (40U» L) 


0.5 


Superscript II RT (InvitrogenttiD 


1 




20 fi\ 



10 12 0] 

cDNA^J&RjSi: £-f_tfH?>RNAsin£ Superscript II^KI^I'&ltfl!) 

ifcfc RNAsin t Superscript II&$$n LXA2V.X 1 B#F^»££-£*:^ $ £ KlGXlXlSft 
[0121] 

££>^ i<Oi ? ^LT#f>tt^cDNA^ov^T^ JSITO^JIIK J: *) 5£&cD7*7 << "7--b 
7 h fcffiV^PCRJfilig^foT. 1/7"'; 3 ^RNA<Ol5l?^^lC*?^^DNAiii|i§BlfM-^#/c 

"To 

[0 12 2] 
[^5] 













T7^-f*— 1 






A 


42S-IH 


433R-neo 


41-470 


B 


C/S17ssp 


4680R-IH 


28-3026 


C 


4534S-IH 


7279R-IH 


2280-5625 


D 


7198S-IH 


9367R-IH 


5544-7713 


E 


9247S-NF 


9576R-NF 


7597-7966 



[0 12 3] 
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1^6] 





&*1E5IJ( 5' to 3') 




43S-IH 


CCCCTGTGAGGAACTACTGTCTTCACGC 




1^14 


UQI / SSp 




1E5U1 


l#15 


4534S-IH 


CCACTCAAAGAAAAAGTGTGACGAGCTCGC 


IE5UI 


1^16 


7198S-IH 


GGCTTGGGCACGGCCTGA 




l#17 


9247S-NF 


GCGGTGAAGACC AAG CTCAAACTCACTCCA 


WM\ 


1^18 


433R-neo 


AGAACCTGCGTGCAATCCATC 






4680R-IH 


CCCGTCATGAGGGCGTCG GTGGC 




1-^-20 


7279R-IH 


ACCAGCAACGGTGGGCGGTTGGTAATC 




1^21 


9367R-IH 


GGAACGCGACACGCTGTG 




|#22 


9576R-NF 


AGCTAGCCGTGACTAGGGCTAAGATGGAGC 





[0 12 4] 







7*7^7-1 (10//M) 


1.0 


(10//M) 


1.0 


2.5mM dNTPs 


5.0 


lOx LA Buffer 


5.0 


MgC12 (25mM) 


5.0 


LA Taq(TAKARA) (5U/// 1) 


0.3 


DW (M^tK) 


30.7 


mm. CDNA 


2.0 




50 fi\ 



[0 12 5] 

PCR^JS^^ft i {i> mTofcj3i)-t?$>o^ : 95rt?2^-K ; 98r-eio#^^^g 
68*c-e8^^35^-< 72^-e7^F^;4X:T?^o 

[0 12 6] 

m±c9 iM:Lt#f. *i.*:#PCRj»<7)i^@e?iJ & U ^ <^DNA|£?iJ*fJSi- ^>RNA@5 



, tBlE#2 0 0 3-3 1 0 7 7 8 3 
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HepG2 i&BIIfefzfctt&JFH-l U^ > U3>f=feJtS3i-&^05««f 



>7P— > 


t 


J lax ca -/ 


i 


EH 


nt?pin i 




C( I T" 






HeplH3 


6887 


2734 


C=»A 


T=>N 




DOO/ 








HeplH8 


6580 


2632 


U=>A 


S=*T 




7159 


2825 


U=*C 


Y=*H 


HepiH9 


3342 




A=>G 






3594 




C=>A 


£L 




7230 


2848 


U=>A 


N=*K 


HepIHIO 


-5052 




U=*C 


fcL 




6943 


2753 


C=>A 


P=>T 


HeplHl2 










HeplH13 


4302 




C=*U 






5687 


2334 


G=>A 


G=>D 




6110 


2475 


A=>G 


Y=>C 



[0 12 7] 

[0 12 8] 
[«8] 









s 


mm 


HeLaHl 










HeLalH2 


5550 


2272 


U=>C 


S=>P 




6252 




A=>G 


fcL 




7182 




U=>C 


£L 




7217 


2844 


A=>G 


H=>R 


HeLalH5 


3643 


1653 


A=*G 


M=>V 




5851 


2389 


G=»A 


A=>T 




5914 


2410 


G=>A 


E=>K 



[0 12 9] 

$7^8i:^^t, THepIH No. J tiU HepG2 «£/Bv>T? u - WfcLfc U^'J 3 

aiiE#2 0 0 3-3 1 0.7 7 8 3 
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SGREP-JFHlO^*@e^iJ (imttl) f^t^MWi LT/TLTV>4 0 TT5 
/m#-^J tt, JFH-l*n->K3- K$tL^>T5 ymse^lj (@B?iJ#^4) tfOMlStS 

l^oti2«$^lTi/^ 0 3t7 ti3*K mx.it, tu-V HepIH17?H. V:/ 

V a ^RNA±OE2?9#^ <Di&&#-^ 6826 KffiS-tittafejW^feA^JEII U foil, 

m\m^<r> 757 »#-*27i4 i:«in7?y maw e>E^ag#i Lfzz\tzmLx\,*z> 0 

mKmSiz^x, iHeLalHNo.J f±. HeLaM£fflv>T, ^ n - > it Liz l/^'J a > 
[0 13 0] 

= i'RNAK attars ^Bft3E*«-5l§iBi-r«[*IEIIl±. IBIJfeK js^r 
10 13 1] 

x?v-->yr2>fztt><Dm l m&tLxmmx$>z> 0 

[0 1 3 2 j 

imi\ mat, ^w^^hcv-rna^-/'; x>*ftm^&tz&<r>mwmk<7>mm s £ 
m*7Fiymmx#>z> 0 mi<D±m±, pjraiWpjcHi^-^ ;^r/AflfAi^© 
#3t^7Ki- 0 micoT^(±. pjFHi^tfpjcHKD^-r ^rnxmrno— 

5 KDNA pSGREP-JFHl^^pSGREP-JCHlO ^ *V A# A«ttO*£*3Fi- e ®* 
OBE-^li&T<afc*5»)-C*4o T7: T7 RNA:/n*-*- N G: #ALfcJFH-lXttJ 
CH-lfi*DNA(^ 5'm<Q±ffifr~Dr7 RNAyn^e- ? -@2?iJo 3 'MOTmizmx LfcdGT 
P, 5'NTR: S'^NiiRfgj^ Core: ^7^ W^f, 3'NTR: 3'^iiiR ! RM 
tt*. El, E2: ^>-?n-^^>-^^K 0 NS2, NS3, NS4A, NS4B, 
NS5A, NS5B: **** WC*ff 0 Age I, Cla I, Xba I: ftt||R&%Age I, CI 
a I&OTba I<7>^JBffSi5^ 0 GDD: NS5B* >s*?ft<Dl&&qifriziR£-j-&7$ 
^-^GDDOftgo neo: v ^teflM*^ EMCV IRES: EMCV 

ires m>i-m$t*7 4 )\sx<dw) #v-Am&%M30 0 

[HI2A] H2Ali, rSGREP-JFH10ig3fe@e^JS:^-t 0 
[HI 2 B ] IH 2 B f±. rSGREP-JFHlO^gB^J fcjjrf 0 
[m 2 C ] HI 2 C ii, rSGREP-JFHl<7>^@2?!j£7Fl- 0 
[HI 2D] Hl2Di;J\ rSGREP-JFHlcom^ie^J^^-ro 
[HI 2 E] HI 2 E fi, rSGREP-JFH16D^6e^iJ ^^1" 0 
[HI 2 F] @2F!t rSGREP-JFHl^^@e^!j^^f o 
[HI 3 A] Hl3Af±, rSGREP-JCHl©ttafE2?ll*^i-o 
[HI 3 B ] HI 3 B 0\ rSGREP-JCHlom^IB^!) ^^i" 0 
[HI 3 C ] HI 3 C {±s rSGREP-JCHlO^iB^J^^-to 
[ m 3 D ] ® 3 D t± > rSGREP-JCHl Oia^@a?!| * ^i" 0 
lm 3 E ] 03Eli> rSGREP-JCH10^[3giBJiJ4r^i- 0 
[HI3F] H3F&. rSGREP-JCHl<^ta^@e^J^^-r o 

[H4] HUf±. rSGREP-JFHl, rSGREP-JFHl/GND^l^rSGREP-JFHl/dGDD^^tt-p^ h 
7>X7i^i/3^L^Huh7«BJiaO3O^-^^t7t^-e** 0 f7>77x 

• &§E#2 0 0 3 -3 1 0 7 7 8 3 
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^->a > L/wRNAOfiti. _tgtf>3ottv*1*;ri&l00 ng, TlS:tf> 3 o}±v>-f ;ftk300 n 
gT'&& 0 

[0 5] HI 5 rSGREP-JFHl^.y f rSGREP-JCHl^-ett'rah9>'^-7^^V3>L/i 
Huh7i«cD, ^%^OG418OziJ^^0. 5mg/ml-C& &*§^<?> nn^ —j&$.$:7FL?z^M 

[16] H6i±. h 9 >X7^? ->a ^^EB^On n^-^^f^c^J-i-^,, Mung Bean 
Nuc 1 ease^3®<7>f^# £ ^ L £ ¥*"?r * -So >7^^-7^^'>3>Lfc rSGREP-JFHl RN 
k<DMit, V»-f *L&100 ngT?&& 0 v^-f *L^^%4JOG418<^jftK{il.0mg/mlT*^> 

[H17] E]7t±, rSGREP-JFHlfc f7^X7x^ V3 >- LT«3: Lfc V/V 3 
M * d - > h-* ;vM14RNA£$f £ &Huh7« ffiF7^7x^y 3 > 

Jfe? n - y 6 h - * ;v,«14RNA£ffi^T 9 6 3 Di-©||^ ttbb tttz 

fcMW-M: h-*;i^J&t£RNA£/Bv*T7 7 3D--© 

^^ti6^tL7t^*^1-„ W*i& V^V 3>RNA£ 1 X 1 0 7 3f-^tfi^ 

[El 8] HI8t±. rSGREP-JFHl£ ^7^^7x^y 3 > t T #21 L 7t V y; n >^ 
jMJfe* n - > fi^O h - * ;HflJJ£ttRNA£$r 3rHuh7*MfeU?f F7^7x^y 3 > 
ITMH^n- VffiSfcO h~^;VRNA^LT. rSGREP-JFHl#HWyn-^=SrflS 

^TOiisiJ^^o 1 0 8 : HuhTttUfe;^ kffcfcBLfch-* JHTOAt:: WWT 1*3 T^Jft L 
/cl^'J 3>RNA£ 1008*3 tf -in yf)V, 10 7 : Huh7iSfflJ&^<bttaiL*: h 
- ? /HaiAfciflBMf ft X*^$L L^WJa >RNA£ 10O 7 * a tf - iJP £ fc^ > 7>K Hu 
h7: h7^^7xi- >LTv>&v>Huh7«^»ffiL£ h-*;l/RNA. r-ZW* 
u->: ^-;^n-^t,i(ljU h-^;VRNA, 1-1 1: »o->i~i 
1 *L-e*L*» 5> WW L h - * ;WRNA 0 r V 3 >RNAJ t±> rSGREP-JFHKD^ 
ft^X^tv-*-, T28SJ {i4.5kbCO^T-S- 1 f'l'X^^i- 1 J *'V-ARNAv- 
ri8SJ {il.9kbO^T-S-9-^X^^i-'; ^V-ARNA^-^-O^IbfeM^^ 

[HI 9 ] EI 9 l±> rSGREP-JFHlXttrSGREP-JCHlft*«»V^U 3 >RNA^ 5 H h7^7, 

&t)X°$>Z> 0 M : DNA^iv -77-, 1 ~ 8 : rSGREP-JFHl &3fc$HJJ& 9 n - > 1 ~ 8 
, N: h5^7x^yg > LTV^v>Huh7jBBfl&> P: WttJtJS (**v>f 
a^OPCBmdm*) o -y7> £1lfl¥Kov->OifciB{:mTOfc33i9T-&&o M: 
DNA^-T-*^-* 1 - 6 : rSGREP-JCHlfi5Nffifl&* n -> 1 ~ 6 0 
[HI 1 0 ] @10U, rSGREP-JFHlXfirSGREP-JCHlft^SSi V? U 3 >RNA;W h 9 

*Sr^-r¥K"C**o £«^>V-> 1 ~8 : rSGREP-JFHlfi$&fflJ&*n-> 1~ 
8 o 1 ~ 6 : rSGREP-JCHlftJfcWia^ n - > l ~ 6 0 £ffl¥*0 V 

->OP: NS3*>v?^« (»te*tJB) , N: h 9 7 ^ * v 3 > LTV*&v>Huh7 

[Bail] HlUi, rSGREP-JFHlS3feO«»V/'; 3 >RNAOHuh7^fl&^OS b 9 > 
*7^?>/3 y^tflL/; 2 1 O^i^n-V^^WfU/cI/T"'; 3>RNA* O 

[Hi 2] 112l±> 1) HepG2ifflJ&, 2) IMY-N9^BJ^, 3) 293^^. Xt±4) HelalHJ& 
£ffiv*/irSGREP-JFHl hr/X7x^y 3 >O^^7j^i- 0 
[113] i 1 3 li, V7"'J v>i®MMM?v-><o; — ^ >?uyr-4 yyoBZk 

[114] ESHIi, 'syW&fcittors ADNA^oa.*&**HB1-S 

tii iiE#2 0 0 3-3 1 0 7 7 8 3 
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[Ull 5] 01 5l±, 1/ 3 >W1 ^ D - y ^ y I f ^ y 7 

[0 13 3] 

se^J*-^- 8~i2 - Aiie?!]<7>ra : £-j&rna 
i3~4i - AXBB^ilora : ^JftDNA 



fcB8E#2 003-310 7 783 
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SEQUENCE LISTING 

<110> Toray Industries Inc. 

Tokyo Metropolitan Organization for Medical Research 
Johannes Gutenberg-Universitaet Mainz 

<120> Establishment of the genotype 2a Hepatitis C virus subgenomic replicon 

<130> P03-0852 

<140> 
<141> 

<150> JP 2003-148242 
<151> 2003-05-26 

<160> 41 

<170> Patentln Ver. 2. 1 

<210> 1 
<211> 8024 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 
<220> 

<223> Inventor: Wakita, Takaji 
Inventor: Kato, Takanobu 
Inventor: Date, Tomoko 

<400> 1 

accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgccua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
cgcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa auccuaaacc 360 
ucaaagaaaa accaaaagaa acaccaaccg ucgcccaaug auugaacaag auggauugca 420 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 480 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 540 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 720 
uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 780 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 840 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 900 

ffitt#2 0 03-310 7783 
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cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 1020 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 1140 
ucccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguuuaaac 1200 
ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgaugau accauggcuc ccaucacugc uuaugcccag 1800 
caaacacgag gccuccuggg cgccauagug gugaguauga cggggcguga caggacagaa 1860 
caggccgggg aaguccaaau ccuguccaca gucucucagu ccuuccucgg aacaaccauc 1920 
ucggggguuu uguggacugu uuaccacgga gcuggcaaca agacucuagc cggcuuacgg 1980 
gguccgguca cgcagaugua cucgagugcu gagggggacu ugguaggcug gcccagcccc 2040 
ccugggacca agucuuugga gccgugcaag uguggagccg ucgaccuaua ucuggucacg 2100 
cggaacgcug augucauccc ggcucggaga cgcggggaca agcggggagc auugcucucc 2160 
ccgagaccca uuucgaccuu gaaggggucc ucgggggggc cggugcucug cccuaggggc 2220 
cacgucguug ggcucuuccg agcagcugug ugcucucggg gcguggccaa auccaucgau 2280 
uucauccccg uugagacacu cgacguuguu acaaggucuc ccacuuucag ugacaacagc 2340 
acgccaccgg cugugcccca gaccuaucag gucggguacu ugcaugcucc aacuggcagu 2400 
ggaaagagca ccaagguccc ugucgcguau gccgcccagg gguacaaagu acuagugcuu 2460 
aaccccucgg uagcugccac ccugggguuu ggggcguacc uauccaaggc acauggcauc 2520 
aaucccaaca uuaggacugg agucaggacc gugaugaccg gggaggccau cacguacucc 2580 
acauauggca aauuucucgc cgaugggggc ugcgcuagcg gcgccuauga caucaucaua 2640 
ugcgaugaau gccacgcugu ggaugcuacc uccauucucg gcaucggaac gguccuugau 2700 
caagcagaga cagccggggu cagacuaacu gugcuggcua cggccacacc ccccggguca 2760 
gugacaaccc cccaucccga uauagaagag guaggccucg ggcgggaggg ugagaucccc 2820 
uucuauggga gggcgauucc ccuauccugc aucaagggag ggagacaccu gauuuucugc 2880 
cacucaaaga aaaaguguga cgagcucgcg gcggcccuuc ggggcauggg cuugaaugcc 2940 
guggcauacu auagaggguu ggacgucucc auaauaccag cucagggaga uguggugguc 3000 
gucgccaccg acgcccucau gacgggguac acuggagacu uugacuccgu gaucgacugc 3060 
aauguagcgg ucacccaagc ugucgacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cacaagacgc ugucucacgc agucagcgcc gcgggcgcac agguagagga 3180 
agacagggca cuuauaggua uguuuccacu ggugaacgag ccucaggaau guuugacagu 3240 
guagugcuuu gugagugcua cgacgcaggg gcugcguggu acgaucucac accagcggag 3300 
accaccguca ggcuuagagc guauuucaac acgcccggcc uacccgugug ucaagaccau 3360 
cuugaauuuu gggaggcagu uuucaccggc cucacacaca uagacgccca cuuccucucc 3420 
caaacaaagc aagcggggga gaacuucgcg uaccuaguag ccuaccaagc uacggugugc 3480 
gccagagcca aggccccucc cccguccugg gacgccaugu ggaagugccu ggcccgacuc 3540 
aagccuacgc uugcgggccc cacaccucuc cuguaccguu ugggcccuau uaccaaugag 3600 
gucacccuca cacacccugg gacgaaguac aucgccacau gcaugcaagc ugaccuugag 3660 
gucaugacca gcacgugggu ccuagcugga ggaguccugg cagccgucgc cgcauauugc 3720 
cuggcgacug gaugcguuuc caucaucggc cgcuugcacg ucaaccagcg agucgucguu 3780 
gcgccggaua aggagguccu guaugaggcu uuugaugaga uggaggaaug cgccucuagg 3840 
gcggcucuca ucgaagaggg gcagcggaua gccgagaugu ugaaguccaa gauccaaggc 3900 
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uugcugcagc aggccucuaa gcaggcccag gacauacaac ccgcuaugca ggcuucaugg 3960 
cccaaagugg aacaauuuug ggccagacac auguggaacu ucauuagcgg cauccaauac 4020 
cucgcaggau ugucaacacu gccagggaac cccgcggugg cuuccaugau ggcauucagu 4080 
gccgcccuca ccaguccguu gucgaccagu accaccaucc uucucaacau caugggaggc 4140 
ugguuagcgu cccagaucgc accacccgcg ggggccaccg gcuuugucgu caguggccug 4200 
gugggggcug ccgugggcag cauaggccug gguaaggugc ugguggacau ccuggcagga 4260 
uauggugcgg gcauuucggg ggcccucguc gcauucaaga ucaugucugg cgagaagccc 4320 
ucuauggaag augucaucaa ucuacugccu gggauccugu cuccgggagc ccugguggug 4380 
ggggucaucu gcgcggccau ucugcgccgc cacgugggac cgggggaggg cgcgguccaa 4440 
uggaugaaca ggcuuauugc cuuugcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gugacggagu cggaugcguc gcagcgugug acccaacuac uuggcucucu uacuauaacc 4560 
agccuacuca gaagacucca caauuggaua acugaggacu gccccauccc augcuccgga 4620 
uccuggcucc gcgacgugug ggacuggguu ugcaccaucu ugacagacuu caaaaauugg 4680 
cugaccucua aauuguuccc caagcugccc ggccuccccu ucaucucuug ucaaaagggg 4740 
uacaagggug ugugggccgg cacuggcauc augaccacgc gcugcccuug cggcgccaac 4800 
aucucuggca auguccgccu gggcucuaug aggaucacag ggccuaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugc uacacggagg gccagugcgc gccgaaaccc 4920 
cccacgaacu acaagaccgc caucuggagg guggcggccu cggaguacgc ggaggugacg 4980 
cagcaugggu cguacuccua uguaacagga cugaccacug acaaucugaa aauuccuugc 5040 
caacuaccuu cuccagaguu uuucuccugg guggacggug ugcagaucca uagguuugca 5100 
cccacaccaa agccguuuuu ccgggaugag gucucguucu gcguugggcu uaauuccuau 5160 
gcugucgggu cccagcuucc cugugaaccu gagcccgacg cagacguauu gagguccaug 5220 
cuaacagauc cgccccacau cacggcggag acugcggcgc ggcgcuuggc acggggauca 5280 
ccuccaucug aggcgagcuc cucagugagc cagcuaucag caccgucgcu gcgggccacc 5340 
ugcaccaccc acagcaacac cuaugacgug gacauggucg augccaaccu gcucauggag 5400 
ggcggugugg cucagacaga gccugagucc agggugcccg uucuggacuu ucucgagcca 5460 
auggccgagg aagagagcga ccuugagccc ucaauaccau cggagugcau gcuccccagg 5520 
agcggguuuc cacgggccuu accggcuugg gcacggccug acuacaaccc gccgcucgug 5580 
gaaucgugga ggaggccaga uuaccaaccg cccaccguug cugguugugc ucuccccccc 5640 
cccaagaagg ccccgacgcc ucccccaagg agacgccgga cagugggucu gagcgagagc 5700 
accauaucag aagcccucca gcaacuggcc aucaagaccu uuggccagcc ccccucgagc 5760 
ggugaugcag gcucguccac gggggcgggc gccgccgaau ccggcggucc gacguccccu 5820 
ggugagccgg cccccucaga gacagguucc gccuccucua ugcccccccu cgagggggag 5880 
ccuggagauc cggaccugga gucugaucag guagagcuuc aaccuccccc ccaggggggg 5940 
gggguagcuc ccgguucggg cucggggucu uggucuacuu gcuccgagga ggacgauacc 6000 
accgugugcu gcuccauguc auacuccugg accggggcuc uaauaacucc cuguagcccc 6060 
gaagaggaaa aguugccaau caacccuuug aguaacucgc uguugcgaua ccauaacaag 6120 
guguacugua caacaucaaa gagcgccuca cagagggcua aaaagguaac uuuugacagg 6180 
acgcaagugc ucgacgccca uuaugacuca gucuuaaagg acaucaagcu agcggcuucc 6240 
aaggucagcg caaggcuccu caccuuggag gaggcgugcc aguugacucc accccauucu 6300 
gcaagaucca aguauggauu cggggccaag gagguccgca gcuuguccgg gagggccguu 6360 
aaccacauca aguccgugug gaaggaccuc cuggaagacc cacaaacacc aauucccaca 6420 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaagaaacca 6480 
gcucgccuca ucguuuaccc ugaccucggc guccgggucu gcgagaaaau ggcccucuau 6540 
gacauuacac aaaagcuucc ucaggcggua augggagcuu ccuauggcuu ccaguacucc 6600 
ccugcccaac ggguggagua ucucuugaaa gcaugggcgg aaaagaagga ccccaugggu 6660 
uuuucguaug auacccgaug cuucgacuca accgucacug agagagacau caggaccgag 6720 
gaguccauau accaggccug cucccugccc gaggaggccc gcacugccau acacucgcug 6780 
acugagagac uuuacguagg agggcccaug uucaacagca agggucaaac cugcgguuac 6840 
agacguugcc gcgccagcgg ggugcuaacc acuagcaugg guaacaccau cacaugcuau 6900 
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gugaaagccc uagcggccug caaggcugcg 
ggcgaugacc uaguagucau cucagaaagc 
agagccuuca cggaggccau gaccagguac 
gaauaugacc uggagcuaau aacauccugu 
cggggccgcc gcagauacua ccugaccaga 
ugggaaacag uuagacacuc cccuaucaau 
ccaaccauau ggguucgcau gguccuaaug 
gacacccugg accagaaccu caacuuugag 
uuggaccuuc cagccauaau ugagagguua 
uacucucacc acgaacugac gcggguggcu 
cucagggugu ggaagagucg ggcucgcgca 
aaagcggccg uuugcggccg auaucucuuc 
acuccauugc cggaggcgcg ccuacuggac 
gggggcgaca uuuuucacag cgugucgcgc 
cuccuacuuu ucguaggggu aggccucuuc 
uagguacacu ccauagcuaa cuguuccuuu 

UUUUUUUUUU CUUUUUUUUU UUUUUCCGUC 

uuucuuggug gcuccaucuu agcccuaguc 
augacugcag agagugccgu aacuggucuc 



gggauaguug cgcccacaau gcugguaugc 6960 
caggggacug aggaggacga gcggaaccug 7020 
ucugccccuc cuggugaucc ccccagaccg 7080 
uccucaaaug ugucuguggc guugggcccg 7140 
gacccaacca cuccacucgc ccgggcugcc 7200 
ucauggcugg gaaacaucau ccaguaugcu 7260 
acacacuucu ucuccauucu caugguccaa 7320 
auguauggau caguauacuc cgugaauccu 7380 
cacgggcuug acgccuuuuc uaugcacaca 7440 
ucagcccuca gaaaacuugg ggcgccaccc 7500 
gucagggcgu cccucaucuc ccguggaggg 7560 
aauugggcgg ugaagaccaa gcucaaacuc 7620 
uuauccaguu gguucaccgu cggcgccggc 7680 
gcccgacccc gcucauuacu cuucggccua 7740 
cuacuccccg cucgguagag cggcacacac 7800 
uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu 7860 
uuucuucccu ucucaucuua uucuacuuuc 7920 
acggcuagcu gugaaagguc cgugagccgc 7980 
ucugcagauc augu 8024 



<210> 2 
<211> 8024 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 
<400> 2 

acccgccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgucua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
ugcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa aucccaaacc 360 
ucaaagaaaa accaaaagaa acacuaaccg ucgcccaaug auugaacaag auggauugca 420 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 480 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 540 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 720 
uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 780 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 840 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 900 
cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 1020 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 1140 
ucccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguuuaaac 1200 
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ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgauaau accauggccc ccaucaccgc uuacgcccag 1800 
cagacacgag gucucuuggg cucuauagug gugagcauga cggggcguga caagacagaa 1860 
caggccgggg agguccaagu ccuguccaca gucacucagu ccuuccucgg aacauccauu 1920 
ucgggggucu uauggacugu uuaccacgga gcuggcaaca agacacuagc cggcucgcgg 1980 
ggcccgguca cgcagaugua cucgagcgcc gagggggacu uggucgggug gcccagcccu 2040 
ccugggacca aaucuuugga gccguguacg uguggagcgg ucgaccugua uuuggucacg 2100 
eggaacgcug augucauccc ggcucgaaga cgcggggaca agcggggagc gcugcucucc 2160 
ccgagacccc uuucgaccuu gaaggggucc ucggggggac cugugcuuug cccuaggggc 2220 
cacgcugucg gaaucuuccg ggcagcugug ugcucucggg guguggcuaa guccauagau 2280 
uucauccccg uugagacgcu cgacaucguc acgcggucuc ccaccuuuag ugacaacagc 2340 
acaccaccag cugugcccca gaccuaucag gugggguacu ugcacgcccc cacuggcagu 2400 
ggaaaaagca ccaagguccc cgucgcguac gccgcccagg gguauaaagu gcuggugcuc 2460 
aaucccucgg uggcugccac ccugggauuu ggggcguacu uguccaaggc acauggcauc 2520 
aaccccaaca uuaggacugg agucagaacu gugacgaccg gggagcccau uacauacucc 2580 
acguauggua aauuccucgc cgaugggggc ugcgcaggcg gcgccuauga caucaucaua 2640 
ugcgaugaau gccacucugu ggaugcuacc acuauucucg gcaucgggac aguccuugac 2700 
caagcagaga cagccggggu caggcuaacu guacuggcca cggccacgcc ccccgggucg 2760 
gugacaaccc cccaucccaa uauagaggag guagcccucg gacaggaggg ugagaucccc 2820 
uucuauggga gggcguuucc ccugucuuac aucaagggag ggaggcacuu gauuuucugc 2880 
cacucaaaga aaaaguguga cgagcucgca acggcccuuc ggggcauggg cuugaacgcu 2940 
guggcauauu acagaggguu ggacgucucc auaauaccaa cucaaggaga uguggugguc 3000 
guugccaccg acgcccucau gacgggguau acuggagacu uugacuccgu gaucgacugc 3060 
aacgaagcgg ucacccaggc cguagacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cgcaagacgc ugucucacgu agucagcgcc gagggcgcac ggguagagga 3180 
agacugggca uuuauaggua uguuuccacu ggugagcgag ccucaggaau guuugacagu 3240 
guaguacucu gugagugcua cgacgcagga gcugcuuggu augagcucuc accaguggag 3300 
acgaccguca ggcucagggc guauuucaac acgccuggcu ugccugugug ccaggaccac 3360 
cuugaguuuu gggaggcagu uuucaccggc cucacacaca uagacgcuca uuuccuuucc 3420 
cagacaaagc agucggggga aaauuucgca uacuuaguag ccuaucaggc cacagugugc 3480 
gccagggcca aagcgccccc cccguccugg gacgucaugu ggaagugcuu gacucgacuc 3540 
aagcccacgc uugugggccc uacaccucuc cuguaccguu ugggcucugu uaccaacgag 3600 
gucacccuua cacaccccgu gacaaaauac aucgccacau gcaugcaagc ugaccucgag 3660 
gucaugacca gcacgugggu ccuggcuggg ggagucuuag cagccgucgc cgcguauugc 3720 
uuagcgaccg gguguguuuc caucauuggc cguuuacaca ucaaccagcg agcugucguc 3780 
gcuccggaca aggagguccu cuaugaggcu uuugaugaga uggaggaaug ugccuccaga 3840 
gcggcucucc uugaagaggg gcagcggaua gccgagaugc ugaaguccaa gauccaaggc 3900 
uuauugcagc aagccucuaa acaggcccag gacauacaac ccgcugugca agcuucgugg 3960 
cccaagaugg agcaauucug ggccaaacau auguggaacu ucauaagcgg cauucaguac 4020 
cucgcaggac ugucaacacu gccagggaac ccugcugugg cuuccaugau ggcauucagc 4080 
gccgcccuca ccaguccguu gucaacuagc accaccaucc uucuuaacau ucuggggggc 4140 
uggcuggcgu cccaaauugc gccacccgcg ggggccacug gcuuuguugu caguggccug 4200 
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gugggagcug cuguuggcag cauaggcuug gguaaagugc ugguggacau ccuggcaggg 4260 
uauggugcgg gcauuucggg ggcccucguc gcguuuaaga ucaugucugg cgagaagccc 4320 
uccauggagg augucaucaa cuugcugccu gggauucugu cuccaggugc ucugguggug 4380 
ggagucaucu gcgcggccau ucugcgccgc caugugggac cgggggaagg cgcgguccaa 4440 
uggaugaaca ggcuuaucgc cuucgcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gugacggagu cggaugcguc gcagcguguc acccaacugc uuggcucucu cacuauaacu 4560 
agucuacuca ggagacuuca caacuggauc acugaggauu gccccauccc augcgccggc 4620 
ucguggcucc gcgaugugug ggacuggguc uguaccaucc uaacagacuu uaagaacugg 4680 
cugaccucca agcuguuccc aaagaugccu ggccuccccu uuaucucuug ccaaaagggg 4740 
uacaagggcg ugugggccgg cacuggcauc augaccacac gaugccccug cggcgccaac 4800. 
aucucuggca acguccgcuu gggcucuaug agaaucacag gacccaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugu uauacagaag gccagugcuu gccgaaaccc 4920 
gcguuaaacu ucaagaccgc caucuggaga guggcggccu cagaguacgc ggaagugacg 4980 
cagcacggau cauaugccua uauaacaggg cugaccacug acaacuuaaa agucccuugc 5040 
caacuccccu cuccagaguu uuucucuugg guggacggag uacaaaucca uagguccgcc 5100 
cccacaccaa agccguuuuu ccgggaugag gucucguuca gcguugggcu caauucauuu 5160 
gucgucgggu cucagcuucc cugugacccu gagcccgaca cugagguagu gauguccaug 5220 
cuaacagacc caucccauau cacggcggag gcugcagcgc ggcguuuagc gcggggguca 5280 
cccccaucug aggcaagcuc cucagcgagc cagcugucgg cgccaucgcu gcgagccacc 5340 
ugcaccaccc acgguaggac cuaugaugug gacauggugg augccaaccu guucaugggg 5400 
ggcggcguga uucggauaga gucugagucc aaaguggucg uucuggacuc ccucgacuca 5460 
augaccgagg aagagggcga ccuugagccu ucaguaccau cggaguauau gcuccccagg 5520 
aagagguucc caccggccuu accggcuugg gcgcggccug auuacaaccc accgcuugug 5580 
gaaucgugga agaggccaga uuaccaacca cccacuguug cgggcugugc ucuccccccc 5640 
cccaaaaaga ccccgacgcc uccuccaagg agacgccgga cagugggucu gagcgagagc 5700 
accauaggag augcccucca acagcuggcc aucaaguccu uuggccagcc ccccccaagc 5760 
ggcgauucag gccuuuccac gggggcggac gccgccgacu ccggcgaucg gacacccccu 5820 
gacgaguugg cucuuucgga gacagguucu accuccucca ugcccccccu cgagggggag 5880 
ccuggggacc cagaccugga gccugagcag guagagcuuc aaccuccucc ccaggggggg 5940 
gaggcagcuc ccggcucgga cucggggucc uggucuacuu gcuccgagga ggaugacucc 6000 
gucgugugcu gcuccauguc auauuccugg accggggcuc uaauaacucc uuguagcccc 6060 
gaagaggaaa aguugccaau uaacuccuug agcaacucgc uguugcgaua ccauaacaag 6120 
guauacugua cuacaucaaa gagugccuca cuaagggcua aaaagguaac uuuugauagg 6180 
augcaagugc ucgacgccua uuaugauuca gucuuaaagg acaucaagcu agcggccucc 6240 
aaggucagcg caaggcuccu caccuuagag gaggcgugcc aauugacccc accccacucu 6300 
gcaagaucca aguauggguu uggggcuaag gagguccgca gcuuguccgg gagggccguc 6360 
aaccacauca aguccgugug gaaggaccuc uuggaagacu cacaaacacc aauuccuaca 6420 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaaaaaacca 6480 
gcucgccuua ucguuuaccc ugaccucggc gucagggucu gcgagaagau ggcccuuuau 6540 
gaugucacac aaaagcuucc ucaggcggug augggggcuu cuuauggcuu ccaguacucc 6600 
cccgcucagc ggguggaguu ucucuugaag gcaugggcgg aaaagagaga cccuaugggu 6660 
uuuucguaug auacccgaug cuuugacuca accgucacug agagagacau caggacugag 6720 
gaguccauau accaggccug cuccuuaccc gaggaggccc gaacugccau acacucgcug 6780 
acugagagac ucuauguggg agggcccaug uucaacagca agggccaguc cugcggguac 6840 
aggcguugcc gcgccagcgg ggugcuuacc acuaguaugg ggaacaccau cacaugcuau 6900 
guaaaagccc uagcggcuug caaggcugcg gggauaauug cgcccacgau gcugguaugc 6960 
ggcgacgacu uggucgucau cucagaaagc caggggacug aggaggacga gcggaaccug 7020 
agagccuuca cggaggcuau gaccagguau ucugccccuc cuggugaccc ccccagaccg 7080 
gaauaugacc uggagcuaau aacaucuugu uccucaaacg ugucuguggc acuuggccca 7140 
cagggccgcc gcagauacua ccugaccaga gaccccacca cuucaauugc ccgggcugcc 7200 
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ugggaaacag uuagacacuc cccugucaau ucauggcugg gaaacaucau ccaguacgcu 7260 
ccaaccauau ggguucgcau gguccugaug acacacuucu ucuccauucu cauggcccag 7320 
gacacccuag accagaaccu uaacuuugaa auguacggau cgguguacuc cgugaguccu 7380 
cuggaccucc cagccauaau ugaaagguua cacgggcuug acgccuucuc ucugcacaca 7440 
uacacucccc acgaacugac gcggguggcu ucagcccuca gaaaacuugg ggcgccaccc 7500 
cucagagcgu ggaagagucg ggcgcgugca guuagggcgu cccucaucuc ccgugggggg 7560 
agggcggccg uuugcggucg guaccucuuc aacugggcgg ugaagaccaa gcucaaacuc 7620 
acuccuuugc cggaggcacg ccuccuggau uuguccaguu gguuuaccgu cggcgccggc 7680 
gggggcgaca uuuaucacag cgugucgcgu gcccgacccc gccuauuacu ccuuagccua 7740 
cuccuacuuu cuguaggggu aggccucuuc cuacuccccg cucgauagag cggcacacau 7800 
uagcuacacu ccauagcuaa cuguuccuuu uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu 7860 
uuuuuuuuuu cuuuuuuuuu uuuuucccuc uuucuucccu ucucaucuua uucuacuuuc 7920 
uuucuuggug gcuccaucuu agcccuaguc acggcuagcu gugaaagguc cgugagccgc 7980 
augacugcag agagugccgu aacuggucuc ucugcagauc augu 8024 



<210> 3 
<211> 9678 
<212> DNA 

<213> Hepatitis C virus 

<220> 
<221> CDS 

<222> (341).. (9442) 
<400> 3 

acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 

cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120 

ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 

aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240 

caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300 

cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atg age aca aat cct 355 

Met Ser Thr Asn Pro 
1 5 

aaa cct caa aga aaa acc aaa aga aac acc aac cgt cgc cca gaa gac 403 
Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Glu Asp 
10 15 20 

gtt aag ttc ccg ggc ggc ggc cag ate gtt ggc gga gta tac ttg ttg 451 
Val Lys Phe Pro Gly Gly Gly Gin He Val Gly Gly Val Tyr Leu Leu 
25 30 35 

ccg cgc agg ggc ccc agg ttg ggt gtg cgc acg aca agg aaa act teg 499 
Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr Thr Arg Lys Thr Ser 
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40 45 50 

gag egg tec cag cca cgt ggg aga cgc cag ccc ate ccc aaa gat egg 547 
Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro He Pro Lys Asp Arg 
55 60 65 

cgc tec act ggc aag gee tgg gga aaa cca ggt cgc ccc tgg ccc eta 595 
Arg Ser Thr Gly Lys Ala Trp Gly Lys Pro Gly Arg Pro Trp Pro Leu 
70 75 80 85 

tat ggg aat gag gga etc ggc tgg gca gga tgg etc ctg tec ccc cga 643 
Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 
90 95 100 

ggc tct cgc ccc tec tgg ggc ccc act gac ccc egg cat agg teg cgc 691 
Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg His Arg Ser Arg 
105 110 115 

aac gtg ggt aaa gtc ate gac ace eta acg tgt ggc ttt gee gac etc 739 
Asn Val Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
120 125 130 

atg ggg tac ate ccc gtc gta ggc gee ccg ctt agt ggc gee gee aga 787 
Met Gly Tyr He Pro Val Val Gly Ala Pro Leu Ser Gly Ala Ala Arg 
135 140 145 

get gtc gcg cac ggc gtg aga gtc ctg gag gac ggg gtt aat tat gca 835 
Ala Val Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
150 155 160 165 

aca ggg aac eta ccc ggt ttc ccc ttt tct ate ttc ttg ctg gee ctg 883 
Thr Gly Asn Leu Pro Gly Phe Pro Phe Ser He Phe Leu Leu Ala Leu 
170 175 180 

ttg tec tgc ate ace gtt ccg gtc tct get gee cag gtg aag aat ace 931 
Leu Ser Cys He Thr Val Pro Val Ser Ala Ala Gin Val Lys Asn Thr 
185 190 195 

agt age age tac atg gtg ace aat gac tgc tec aat gac age ate act 979 
Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Ser Asn Asp Ser He Thr 
200 205 210 

tgg cag etc gag get gcg gtt etc cac gtc ccc ggg tgc gtc ccg tgc 1027 
Trp Gin Leu Glu Ala Ala Val Leu His Val Pro Gly Cys Val Pro Cys 
215 220 225 

gag aga gtg ggg aat acg tea egg tgt tgg gtg cca gtc teg cca aac 1075 
Glu Arg Val Gly Asn Thr Ser Arg Cys Trp Val Pro Val Ser Pro Asn 
230 235 .240 245 

mSE# 2003-3107783 
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atg get gtg egg cag ccc ggt gee etc acg cag ggt ctg egg acg cac 1123 
Met Ala Val Arg Gin Pro Gly Ala Leu Thr Gin Gly Leu Arg Thr His 
250 255 260 

ate gat atg gtt gtg atg tec gee ace ttc tgc tct get etc tac gtg 1171 
He Asp Met Val Val Met Ser Ala Thr Phe Cys Ser Ala Leu Tyr Val 
265 270 275 

ggg gac etc tgt ggc ggg gtg atg etc gcg gec cag gtg ttc ate gtc 1219 
Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala Gin Val Phe He Val 
280 285 290 

teg ccg cag tac cac tgg ttt gtg caa gaa tgc aat tgc tec ate tac 1267 
Ser Pro Gin Tyr His Trp Phe Val Gin Glu Cys Asn Cys Ser He Tyr 
295 300 305 

cct ggc ace ate act gga cac cgc atg gca tgg gac atg atg atg aac 1315 
Pro Gly Thr He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
310 315 320 325 

tgg teg ccc acg gee ace atg ate ctg gcg tac gtg atg cgc gtc ccc 1363 
Trp Ser Pro Thr Ala Thr Met He Leu Ala Tyr Val Met Arg Val Pro 
330 335 340 

gag gtc ate ata gac ate gtt age ggg get cac tgg ggc gtc atg ttc 1411 
Glu Val He lie Asp He Val Ser Gly Ala His Trp Gly Val Met Phe 
345 350 355 

ggc ttg gec tac ttc tct atg cag gga gcg tgg gcg aag gtc att gtc 1459 
Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp Ala Lys Val He Val 
360 365 370 

ate ctt ctg ctg gee get ggg gtg gac gcg ggc ace ace ace gtt gga 1507 
He Leu Leu Leu Ala Ala Gly Val Asp Ala Gly Thr Thr Thr Val Gly 
375 380 385 

ggc get gtt gca cgt tec acc aac gtg att gee ggc gtg ttc age cat 1555 
Gly Ala Val Ala Arg Ser Thr Asn Val He Ala Gly Val Phe Ser His 
390 395 400 405 

ggc cct cag cag aac att cag etc att aac acc aac ggc agt tgg cac 1603 
Gly Pro Gin Gin Asn He Gin Leu He Asn Thr Asn Gly Ser Trp His 
410 415 420 

ate aac cgt act gee ttg aat tgc aat gac tec ttg aac acc ggc ttt 1651 
He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Asn Thr Gly Phe 
425 430 435 

etc gcg gec ttg ttc tac acc aac cgc ttt aac teg tea ggg tgt cca 1699 
Leu Ala Ala Leu Phe Tyr Thr Asn Arg Phe Asn Ser Ser Gly Cys Pro 
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440 



445 



450 



ggg cgc ctg tec gec tgc cgc aac ate gag get ttc egg ata ggg tgg 
Gly Arg Leu Ser Ala Cys Arg Asn He Glu Ala Phe Arg He Gly Trp 
455 460 465 



1747 



ggc ace eta cag tac gag gat aat gtc ace aat cca gag gat atg agg 
Gly Thr Leu Gin Tyr Glu Asp Asn Val Thr Asn Pro Glu Asp Met Arg 
470 475 480 485 



1795 



ccg tac tgc tgg cac tac ccc cca aag ccg tgt ggc gta gtc ccc gcg 
Pro Tyr Cys Trp His Tyr Pro Pro Lys Pro Cys Gly Val Val Pro Ala 
490 495 500 



1843 



agg tct gtg tgt ggc cca gtg tac tgt ttc acc ccc age ccg gta gta 
Arg Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
505 510 515 



1891 



gtg ggc acg acc gac aga cgt gga gtg ccc acc tac aca tgg gga gag 
Val Gly Thr Thr Asp Arg Arg Gly Val Pro Thr Tyr Thr Trp Gly Glu 
520 525 530 



1939 



aat gag aca gat gtc ttc eta ctg aac age acc cga ccg ccg cag ggc 
Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr Arg Pro Pro Gin Gly 
535 540 545 



1987 



tea tgg ttc ggc tgc acg tgg atg aac tec act ggt ttc acc aag act 
Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Thr 
550 555 560 565 



2035 



tgt ggc gcg cca cct tgc cgc acc aga get gac ttc aac gee age acg 
Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp Phe Asn Ala Ser Thr 
570 575 580 



2083 



gac ttg ttg tgc cct acg gat tgt ttt agg aag cat cct gat gec act 
Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys His Pro Asp Ala Thr 
585 590 595 



2131 



tat att aag tgt ggt tct ggg ccc tgg etc aca cca aag tgc ctg gtc 
Tyr He Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Lys Cys Leu Val 
600 605 610 



2179 



cac tac cct tac aga etc tgg cat tac ccc tgc aca gtc aat ttt acc 
His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr 
615 620 625 



2227 



ate ttc aag ata aga atg tat gta ggg ggg gtt gag cac agg etc acg 
He Phe Lys He Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Thr 
630 635 640 645 



2275 
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gcc gca tgc aac ttc act cgt ggg gat cgc tgc gac ttg gag gac agg 2323 
Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys Asp Leu Glu Asp Arg 
650 655 660 

gac agg agt cag ctg tct cct ctg ttg cac tct acc acg gaa tgg gcc 2371 
Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser Thr Thr Glu Trp Ala 
665 670 675 

ate ctg ccc tgc acc tac tea gac tta ccc get ttg tea act ggt ctt 2419 
He Leu Pro Cys Thr Tyr Ser Asp Leu Pro Ala Leu Ser Thr Gly Leu 
680 685 690 

etc cac ctt cac cag aac ate gtg gac gta caa tac atg tat ggc etc 2467 
Leu His Leu His Gin Asn He Val Asp Val Gin Tyr Met Tyr Gly Leu 
695 700 705 

tea cct get ate aca aaa tac gtc gtt cga tgg gag tgg gtg gta etc 2515 
Ser Pro Ala He Thr Lys Tyr Val Val Arg Trp Glu Trp Val Val Leu 
710 715 720 725 

tta ttc ctg etc tta gcg gac gcc aga gtc tgc gcc tgc ttg tgg atg 2563 
Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met 
730 735 , 740 

a. 

etc ate ttg ttg ggc cag gcc gaa gca gca ttg gag aag ttg gtc gtc 2611 
Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu Glu Lys Leu Val Val 
745 750 755 

ttg cac get gcg agt gcg get aac tgc cat ggc etc eta tat ttt gcc 2659 
Leu His Ala Ala Ser Ala Ala Asn Cys His Gly Leu Leu Tyr Phe Ala 
760 765 770 

ate ttc ttc gtg gca get tgg cac ate agg ggt egg gtg gtc ccc ttg 2707 
He Phe Phe Val Ala Ala Trp His He Arg Gly Arg Val Val Pro Leu 
775 780 785 

acc acc tat tgc etc act ggc eta tgg ccc ttc tgc eta ctg etc atg 2755 
Thr Thr Tyr Cys Leu Thr Gly Leu Trp Pro Phe Cys Leu Leu Leu Met 
790 795 800 805 

gca ctg ccc egg cag get tat gcc tat gac gca cct gtg cac gga cag 2803 
Ala Leu Pro Arg Gin Ala Tyr Ala Tyr Asp Ala Pro Val His Gly Gin 
810 815 820 

ata ggc gtg ggt ttg ttg ata ttg ate acc etc ttc aca etc acc ccg 2851 
He Gly Val Gly Leu Leu He Leu He Thr Leu Phe Thr Leu Thr Pro 
825 830 835 

ggg tat aag acc etc etc ggc cag tgt ctg tgg tgg ttg tgc tat etc 2899 
Gly Tyr Lys Thr Leu Leu Gly Gin Cys Leu Trp Trp Leu Cys Tyr Leu 
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840 845 850 

ctg acc ctg ggg gaa gcc atg att cag gag tgg gta cca ccc atg cag 2947 
Leu Thr Leu Gly Glu Ala Met He Gin Glu Trp Val Pro Pro Met Gin 
855 860 865 

gtg cgc ggc ggc cgc gat ggc ate gcg tgg gcc gtc act ata ttc tgc 2995 
Val Arg Gly Gly Arg Asp Gly He Ala Trp Ala Val Thr He Phe Cys 
870 875 880 885 

ccg ggt gtg gtg ttt gac att acc aaa tgg ctt ttg gcg ttg ctt ggg 3043 
Pro Gly Val Val Phe Asp He Thr Lys Trp Leu Leu Ala Leu Leu Gly 
890 895 900 

cct get tac etc tta agg gcc get ttg aca cat gtg ccg tac ttc gtc 3091 
Pro Ala Tyr Leu Leu Arg Ala Ala Leu Thr His Val Pro Tyr Phe Val 
905 910 915 

aga get cac get ctg ata agg gta tgc get ttg gtg aag cag etc gcg 3139 
Arg Ala His Ala Leu He Arg Val Cys Ala Leu Val Lys Gin Leu Ala 
920 925 930 

ggg ggt agg tat gtt cag gtg gcg eta ttg gcc ctt ggc agg tgg act 3187 
Gly Gly Arg Tyr Val Gin Val Ala Leu Leu Ala Leu Gly Arg Trp Thr 
935 940 945 

ggc acc tac ate tat gac cac etc aca cct atg teg gac tgg gcc get 3235 
Gly Thr Tyr He Tyr Asp His Leu Thr Pro Met Ser Asp Trp Ala Ala 
950 955 960 965 

age ggc ctg cgc gac tta gcg gtc gcc gtg gaa ccc ate ate ttc agt 3283 
Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro He He Phe Ser 
970 975 980 

ccg atg gag aag aag gtc ate gtc tgg gga gcg gag acg get gca tgt 3331 
Pro Met Glu Lys Lys Val He Val Trp Gly Ala Glu Thr Ala Ala Cys 
985 990 995 

ggg gac att eta cat gga ctt ccc gtg tec gcc cga etc ggc cag gag 3379 
Gly Asp He Leu His Gly Leu Pro Val Ser Ala Arg Leu Gly Gin Glu 
1000 1005 1010 

ate etc etc ggc cca get gat ggc tac acc tec aag ggg tgg aag etc 3427 
He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser Lys Gly Trp Lys Leu 
1015 1020 1025 

ctt get ccc ate act get tat gcc cag caa aca cga ggc etc ctg ggc 3475 
Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
1030 1035 1040 1045 
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gcc ata gtg gtg agt atg acg ggg cgt gac agg aca gaa cag gcc ggg 
Ala He Val Val Ser Met Thr Gly Arg Asp Arg Thr Glu Gin Ala Gly 
1050 1055 1060 



3523 



gaa gtc caa ate ctg tec aca gtc tct cag tec ttc etc gga aca ace 
Glu Val Gin He Leu Ser Thr Val Ser Gin Ser Phe Leu Gly Thr Thr 
1065 1070 1075 



3571 



ate teg ggg gtt ttg tgg act gtt tac cac gga get ggc aac aag act 
He Ser Gly Val Leu Trp Thr Val Tyr His Gly Ala Gly Asn Lys Thr 
1080 1085 1090 



3619 



eta gcc ggc tta egg ggt ccg gtc acg cag atg tac teg agt get gag 
Leu Ala Gly Leu Arg Gly Pro Val Thr Gin Met Tyr Ser Ser Ala Glu 
1095 1100 1105 



3667 



ggg gac ttg gta ggc tgg ccc age ccc cct ggg acc aag tct ttg gag 
Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu 
1110 1115 1120 1125 



3715 



ccg tgc aag tgt gga gcc gtc gac eta tat ctg gtc acg egg aac get 
Pro Cys Lys Cys Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn Ala 
1130 1135 1140 



3763 



gat gtc ate ccg get egg aga cgc ggg gac aag egg gga gca ttg etc 
Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu 
1145 1150 1155 



3811 



tec ccg aga ccc att teg acc ttg aag ggg tec teg ggg ggg ccg gtg 
Ser Pro Arg Pro He Ser Thr Leu Lys Gly Ser Ser Gly Gly Pro Val 
1160 1165 1170 



3859 



etc tgc cct agg ggc cac gtc gtt ggg etc ttc cga gca get gtg tgc 
Leu Cys Pro Arg Gly His Val Val Gly Leu Phe Arg Ala Ala Val Cys 
1175 1180 1185 



3907 



tct egg ggc gtg gcc aaa tec ate gat ttc ate ccc gtt gag aca etc 3955 
Ser Arg Gly Val Ala Lys Ser He Asp Phe He Pro Val Glu Thr Leu 
1190 1195 1200 1205 



gac gtt gtt aca agg tct ccc act ttc agt gac aac age acg cca ccg 
Asp Val Val Thr Arg Ser Pro Thr Phe Ser Asp Asn Ser Thr Pro Pro 
1210 1215 1220 



4003 



get gtg ccc cag acc tat cag gtc ggg tac ttg cat get cca act ggc 4051 
Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu His Ala Pro Thr Gly 
1225 1230 1235 



agt gga aag age acc aag gtc cct gtc gcg tat gcc gcc cag ggg tac 4099 
Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr Ala Ala Gin Gly Tyr 
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1240 1245 1250 

aaa gta eta gtg ctt aac ccc teg gta get gee ace ctg ggg ttt ggg 4147 
Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
1255 1260 1265 

gcg tac eta tec aag gca cat ggc ate aat ccc aac att agg act gga 4195 
Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro Asn He Arg Thr Gly 
1270 1275 1280 1285 

gtc agg acc gtg atg acc ggg gag gee ate acg tac tec aca tat ggc 4243 
Val Arg Thr Val Met Thr Gly Glu Ala He Thr Tyr Ser Thr Tyr Gly 
1290 1295 1300 

aaa ttt etc gee gat ggg ggc tgc get age ggc gee tat gac ate ate 4291 
Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly Ala Tyr Asp He He 
1305 1310 1315 

ata tgc gat gaa tgc cac get gtg gat get acc tec att etc ggc ate 4339 
He Cys Asp Glu Cys His Ala Val Asp Ala Thr Ser He Leu Gly He 
1320 1325 1330 

gga acg gtc ctt gat caa gca gag aca gee ggg gtc aga eta act gtg 4387 
Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Val Arg Leu Thr Val 
1335 1340 1345 

ctg get acg gee aca ccc ccc ggg tea gtg aca acc ccc cat ccc gat 4435 
Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asp 
1350 1355 1360 1365 

ata gaa gag gta ggc etc ggg egg gag ggt gag ate ccc ttc tat ggg 4483 
He Glu Glu Val Gly Leu Gly Arg Glu Gly Glu He Pro Phe Tyr Giy 
1370 1375 1380 

agg gcg att ccc eta tec tgc ate aag gga ggg aga cac ctg att ttc 4531 
Arg Ala He Pro Leu Ser Cys He Lys Gly Gly Arg His Leu He Phe 
1385 1390 1395 

tgc cac tea aag aaa aag tgt gac gag etc gcg gcg gee ctt egg ggc 4579 
Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Ala Leu Arg Gly 
1400 1405 1410 

atg ggc ttg aat gee gtg gca tac tat aga ggg ttg gac gtc tec ata 4627 
Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser He 
1415 1420 1425 

ata cca get cag gga gat gtg gtg gtc gtc gee acc gac gee etc atg 4675 
He Pro Ala Gin Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met 
1430 1435 1440 1445 
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acg ggg tac act gga gac ttt gac tec gtg ate gac tgc aat gta gcg 4723 
Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Val Ala 
1450 1455 1460 

gtc acc caa get gtc gac ttc age ctg gac ccc acc ttc act ata ace 4771 
Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Thr 
1465 1470 1475 

aca cag act gtc cca caa gac get gtc tea cgc agt cag cgc cgc ggg 4819 
Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
1480 1485 1490 

cgc aca ggt aga gga aga cag ggc act tat agg tat gtt tec act ggt 4867 
Arg Thr Gly Arg Gly Arg Gin Gly Thr Tyr Arg Tyr Val Ser Thr Gly 
1495 1500 1505 

gaa cga gee tea gga atg ttt gac agt gta gtg ctt tgt gag tgc tac 4915 
Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr 
1510 1515 1520 1525 

gac gca ggg get gcg tgg tac gat etc aca cca gcg gag acc acc gtc 4963 
Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro Ala Glu Thr Thr Val 
1530 1535 1540 

agg ctt aga gcg tat ttc aac acg ccc ggc eta ccc gtg tgt caa gac 5011 
Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
1545 1550 1555 

cat ctt gaa ttt tgg gag gca gtt ttc acc ggc etc aca cac ata gac 5059 
His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly Leu Thr His He Asp 
1560 1565 1570 

gee cac ttc etc tec caa aca aag caa gcg ggg gag aac ttc gcg tac 5107 
Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Glu Asn Phe Ala Tyr 
1575 1580 1585 

eta gta gec tac caa get acg gtg tgc gec aga gec aag gee cct ccc 5155 
Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro 
1590 1595 1600 1605 

ccg tec tgg gac gec atg tgg aag tgc ctg gec cga etc aag cct acg 5203 
Pro Ser Trp Asp Ala Met Trp Lys Cys Leu Ala Arg Leu Lys Pro Thr 
1610 1615 1620 

ctt gcg ggc ccc aca cct etc ctg tac cgt ttg ggc cct att acc aat 5251 
Leu Ala Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro He Thr Asn 
1625 1630 1635 

gag gtc acc etc aca cac cct ggg acg aag tac ate gec aca tgc atg 5299 
Glu Val Thr Leu Thr His Pro Gly Thr Lys Tyr He Ala Thr Cys Met 
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1640 



1645 



1650 



caa get gac ctt gag gtc atg acc age acg tgg gtc eta get gga gga 
Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly Gly 
1655 1660 1665 



5347 



gtc ctg gca gee gtc gec gca tat tgc ctg gcg act gga tgc gtt tec 
Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys Val Ser 
1670 1675 1680 1685 



5395 



ate ate ggc cgc ttg cac gtc aac cag cga gtc gtc gtt gcg ccg gat 
He He Gly Arg Leu His Val Asn Gin Arg Val Val Val Ala Pro Asp 
1690 1695 1700 



5443 



aag gag gtc ctg tat gag get ttt gat gag atg gag gaa tgc gec tct 
Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu Glu Cys Ala Ser 
1705 1710 1715 



5491 



agg gcg get etc ate gaa gag ggg cag egg ata gec gag atg ttg aag 
Arg Ala Ala Leu He Glu Glu Gly Gin Arg He Ala Glu Met Leu Lys 
1720 1725 1730 



5539 



tec aag ate caa ggc ttg ctg cag cag gee tct aag cag gec cag gac 
Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser Lys Gin Ala Gin Asp 
1735 1740 1745 



5587 



ata caa ccc get atg cag get tea tgg ccc aaa gtg gaa caa ttt tgg 
He Gin Pro Ala Met Gin Ala Ser Trp Pro Lys Val Glu Gin Phe Trp 
1750 1755 1760 1765 



5635 



gec aga cac atg tgg aac ttc att age ggc ate caa tac etc gca gga 
Ala Arg His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly 
1770 1775 1780 



5683 



ttg tea aca ctg cca ggg aac ccc gcg gtg get tec atg atg gca ttc 5731 
Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala Ser Met Met Ala Phe 
1785 1790 1795 

agt gec gec etc acc agt ccg ttg teg acc agt acc acc ate ctt etc 5779 
Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser Thr Thr He Leu Leu 
1800 1805 1810 



aac ate atg gga ggc tgg tta gcg tec cag ate gca cca ccc gcg ggg 5827 
Asn He Met Gly Gly Trp Leu Ala Ser Gin He Ala Pro Pro Ala Gly 
1815 1820 1825 

gec acc ggc ttt gtc gtc agt ggc ctg gtg ggg get gec gtg ggc age 5875 
Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly Ala Ala Val Gly Ser 
1830 1835 1840 1845 
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ata ggc ctg ggt aag gtg ctg gtg gac ate ctg gca gga tat ggt gcg 
lie Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
1850 1855 1860 



5923 



ggc att teg ggg gee etc gtc gca ttc aag ate atg tct ggc gag aag 5971 
Gly He Ser Gly Ala Leu Val Ala Phe Lys He Met Ser Gly Glu Lys 
1865 1870 1875 

ccc tct atg gaa gat gtc ate aat eta ctg cct ggg ate ctg tct ccg 6019 
Pro Ser Met Glu Asp Val He Asn Leu Leu Pro Gly He Leu Ser Pro 
1880 1885 1890 



gga gee ctg gtg gtg ggg gtc ate tgc gcg gee att ctg cgc cgc cac 
Gly Ala Leu Val Val Gly Val He Cys Ala Ala He Leu Arg Arg His 
1895 1900 1905 



6067 



gtg gga ccg ggg gag ggc gcg gtc caa tgg atg aac agg ctt att gee 
Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 
1910 1915 1920 1925 



6115 



ttt get tec aga gga aac cac gtc gee cct act cac tac gtg acg gag 
Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr His Tyr Val Thr Glu 
1930 1935 1940 



6163 



teg gat gcg teg cag cgt gtg acc caa eta ctt ggc tct ctt act ata 
Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu Gly Ser Leu Thr He 
1945 1950 1955 



6211 



acc age eta etc aga aga etc cac aat tgg ata act gag gac tgc ccc 
Thr Ser Leu Leu Arg Arg Leu His Asn Trp He Thr Glu Asp Cys Pro 
1960 1965 1970 



6259 



ate cca tgc tec gga tec tgg etc cgc gac gtg tgg gac tgg gtt tgc 
He Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp Val Cys 
1975 1980 1985 



6307 



acc ate ttg aca gac ttc aaa aat tgg ctg acc tct aaa ttg ttc ccc 
Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro 
1990 1995 2000 2005 



6355 



aag ctg ccc ggc etc ccc ttc ate tct tgt caa aag ggg tac aag ggt 
Lys Leu Pro Gly Leu Pro Phe He Ser Cys Gin Lys Gly Tyr Lys Gly 
2010 2015 2020 



6403 



gtg tgg gee ggc act ggc ate atg acc acg cgc tgc cct tgc ggc gee 6451 
Val Trp Ala Gly Thr Gly He Met Thr Thr Arg Cys Pro Cys Gly Ala 
2025 2030 2035 

aac ate tct ggc aat gtc cgc ctg ggc tct atg agg ate aca ggg cct 6499 
Asn He Ser Gly Asn Val Arg Leu Gly Ser Met Arg He Thr Gly Pro 
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2040 2045 2050 

aaa acc tgc atg aac acc tgg cag ggg acc ttt cct ate aat tgc tac 6547 
Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe Pro He Asn Cys Tyr 
2055 2060 2065 

acg gag ggc cag tgc gcg ccg aaa ccc ccc acg aac tac aag acc gec 6595 
Thr Glu Gly Gin Cys Ala Pro Lys Pro Pro Thr Asn Tyr Lys Thr Ala 
2070 2075 2080 2085 

ate tgg agg gtg gcg gec teg gag tac gcg gag gtg acg cag cat ggg 6643 
He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gin His Gly 
2090 2095 2100 

teg tac tec tat gta aca gga ctg acc act gac aat ctg aaa att cct 6691 
Ser Tyr Ser Tyr Val Thr Gly Leu Thr Thr Asp Asn Leu Lys He Pro 
2105 2110 2115 

tgc caa eta cct tct cca gag ttt ttc tec tgg gtg gac ggt gtg cag 6739 
Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gin 
2120 2125 2130 

ate cat agg ttt gca ccc aca cca aag ccg ttt ttc egg gat gag gtc 6787 
He' His Arg Phe Ala Pro Thr Pro Lys Pro Phe Phe Arg Asp Glu Val 
2135 2140 2145 

teg ttc tgc gtt ggg ctt aat tec tat get gtc ggg tec cag ctt ccc 6835 
Ser Phe Cys Val Gly Leu Asn Ser Tyr Ala Val Gly Ser Gin Leu Pro 
2150 2155 2160 2165 

tgt gaa cct gag ccc gac gca gac gta ttg agg tec atg eta aca gat 6883 
Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg Ser Met Leu Thr Asp 
2170 2175 2180 

ccg ccc cac ate acg gcg gag act gcg gcg egg cgc ttg gca egg gga 6931 
Pro Pro His He Thr Ala Glu Thr Ala Ala Arg Arg Leu Ala Arg Gly 
2185 2190 2195 

tea cct cca tct gag gcg age tec tea gtg age cag eta tea gca ccg 6979 
Ser Pro Pro Ser Glu Ala Ser Ser Ser Val Ser Gin Leu Ser Ala Pro 
2200 2205 2210 

teg ctg egg gec acc tgc acc acc cac age aac acc tat gac gtg gac 7027 
Ser Leu Arg Ala Thr Cys Thr Thr His Ser Asn Thr Tyr Asp Val Asp 
2215 2220 2225 

atg gtc gat gee aac ctg etc atg gag ggc ggt gtg get cag aca gag 7075 
Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly Val Ala Gin Thr Glu 
2230 2235 2240 2245 
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cct gag tec agg gtg ccc gtt ctg gac ttt etc gag cca atg gee gag 7123 
Pro Glu Ser Arg Val Pro Val Leu Asp Phe Leu Glu Pro Met Ala Glu 
2250 2255 2260 

gaa gag age gac ctt gag ccc tea ata cca teg gag tgc atg etc ccc 7171 
Glu Glu Ser Asp Leu Glu Pro Ser He Pro Ser Glu Cys Met Leu Pro 
2265 2270 2275 

agg age ggg ttt cca egg gee tta ccg get tgg gca egg cct gac tac 7219 
Arg Ser Gly Phe Pro Arg Ala Leu Pro Ala Trp Ala Arg Pro Asp Tyr 
2280 2285 2290 

aac ccg ccg etc gtg gaa teg tgg agg agg cca gat tac caa ccg ccc 7267 
Asn Pro Pro Leu Val Glu Ser Trp Arg Arg Pro Asp Tyr Gin Pro Pro 
2295 2300 2305 

ace gtt get ggt tgt get etc ccc ccc ccc aag aag gee ccg acg cct 7315 
Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys Lys Ala Pro Thr Pro 
2310 2315 2320 2325 

ccc cca agg aga cgc egg aca gtg ggt ctg age gag age ace ata tea 7363 
Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Thr He Ser 
2330 2335 2340 

gaa gee etc cag caa ctg gee ate aag acc ttt ggc cag ccc ccc teg 7411 
Glu Ala Leu Gin Gin Leu Ala He Lys Thr Phe Gly Gin Pro Pro Ser 
2345 2350 2355 

age ggt gat gca ggc teg tec acg ggg gcg ggc gee gee gaa tec ggc 7459 
Ser Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly Ala Ala Glu Ser Gly 
2360 2365 2370 

ggt ccg acg tec cct ggt gag ccg gec ccc tea gag aca ggt tec gee 7507 
Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser Glu Thr Gly Ser Ala 
2375 2380 2385 

tec tct atg ccc ccc etc gag ggg gag cct gga gat ccg gac ctg gag 7555 
Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Glu 
2390 2395 2400 2405 

tct gat cag gta gag ctt caa cct ccc ccc cag ggg ggg ggg gta get 7603 
Ser Asp Gin Val Glu Leu Gin Pro Pro Pro Gin Gly Gly Gly Val Ala 
2410 2415 2420 

ccc ggt teg ggc teg ggg tct tgg tct act tgc tec gag gag gac gat 7651 
Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr Cys Ser Glu Glu Asp Asp 
2425 2430 2435 

acc acc gtg tgc tgc tec atg tea tac tec tgg acc ggg get eta ata 7699 
Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu He 
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2440 2445 2450 

act ccc tgt age ccc gaa gag gaa aag ttg cca ate aac cct ttg agt 7747 
Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro He Asn Pro Leu Ser 
2455 2460 2465 

aac teg ctg ttg cga tac cat aac aag gtg tac tgt aca aca tea aag 7795 
Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr Cys Thr Thr Ser Lys 
2470 2475 2480 2485 

age gec tea cag agg get aaa aag gta act ttt gac agg acg caa gtg 7843 
Ser Ala Ser Gin Arg Ala Lys Lys Val Thr Phe Asp Arg Thr Gin Val 
2490 2495 2500 

etc gac gec cat tat gac tea gtc tta aag gac ate aag eta gcg get 7891 
Leu Asp Ala His Tyr Asp Ser Val Leu Lys Asp He Lys Leu Ala Ala 
2505 2510 2515 

tec aag gtc age gca agg etc etc acc ttg gag gag gcg tgc cag ttg 7939 
Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gin Leu 
2520 2525 2530 

act cca ccc cat tct gca aga tec aag tat gga ttc ggg gec aag gag 7987 
Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu 
2535 2540 2545 

gtc cgc age ttg tec ggg agg gec gtt aac cac ate aag tec gtg tgg 8035 
Val Arg Ser Leu Ser Gly Arg Ala Val Asn His He Lys Ser Val Trp 
2550 2555 2560 2565 

aag gac etc ctg gaa gac cca caa aca cca att ccc aca acc ate atg 8083 
Lys Asp Leu Leu Glu Asp Pro Gin Thr Pro He Pro Thr Thr He Met 
2570 2575 2580 

gec aaa aat gag gtg ttc tgc gtg gac ccc gec aag ggg ggt aag aaa 8131 
Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala Lys Gly Gly Lys Lys 
2585 2590 2595 

cca get cgc etc ate gtt tac cct gac etc ggc gtc egg gtc tgc gag 8179 
Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu 
2600 2605 2610 

aaa atg gec etc tat gac att aca caa aag ctt cct cag gcg gta atg 8227 
Lys Met Ala Leu Tyr Asp He Thr Gin Lys Leu Pro Gin Ala Val Met 
2615 2620 2625 

gga get tec tat ggc ttc cag tac tec cct gec caa egg gtg gag tat 8275 
Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala Gin Arg Val Glu Tyr 
2630 2635 2640 2645 
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etc ttg aaa gca tgg gcg gaa aag aag gac ccc atg ggt ttt teg tat 
Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro Met Gly Phe Ser Tyr 
2650 2655 2660 



8323 



gat ace cga tgc ttc gac tea acc gtc act gag aga gac ate agg acc 
Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg Asp He Arg Thr 
2665 2670 2675 



8371 



gag gag tec ata tac cag gee tgc tec ctg ccc gag gag gee cgc act 
Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro Glu Glu Ala Arg Thr 
2680 2685 2690 



8419 



gee ata cac teg ctg act gag aga ctt tac gta gga ggg ccc atg ttc 
Ala He His Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Phe 
2695 2700 2705 



8467 



aac age aag ggt caa acc tgc ggt tac aga cgt tgc cgc gee age ggg 
Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly 
2710 2715 2720 2725 



8515 



gtg eta acc act age atg ggt aac acc ate aca tgc tat gtg aaa gec 
Val Leu Thr Thr Ser Met Gly Asn Thr He Thr Cys Tyr Val Lys Ala 
2730 . 2735 2740 



8563 



eta gcg gec tgc aag get gcg ggg ata gtt gcg ccc aca atg ctg gta 
Leu Ala Ala Cys Lys Ala Ala Gly He Val Ala Pro Thr Met Leu Val 
2745 2750 2755 



8611 



tgc ggc gat gac eta gta gtc ate tea gaa age cag ggg act gag gag 
Cys Gly Asp Asp Leu Val Val He Ser Glu Ser Gin Gly Thr Glu Glu 
2760 2765 2770 



8659 



gac gag egg aac ctg aga gee ttc acg gag gec atg acc agg tac tct 
Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser 
2775 2780 2785 



8707 



gec cct cct ggt gat ccc ccc aga ccg gaa tat gac ctg gag eta ata 
Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu He 
2790 2795 2800 2805 



8755 



aca tec tgt tec tea aat gtg tct gtg gcg ttg ggc ccg egg ggc cgc 
Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu Gly Pro Arg Gly Arg 
2810 2815 2820 



8803 



cgc aga tac tac ctg acc aga gac cca acc act cca etc gee egg get 
Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala 
2825 2830 2835 



8851 



gec tgg gaa aca gtt aga cac tec cct ate aat tea tgg ctg gga aac 8899 
Ala Trp Glu Thr Val Arg His Ser Pro He Asn Ser Trp Leu Gly Asn 
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2840 2845 2850 

ate ate cag tat get cca ace ata tgg gtt cgc atg gtc eta atg aca 8947 
He He Gin Tyr Ala Pro Thr He Tip Val Arg Met Val Leu Met Thr 
2855 2860 2865 

cac ttc ttc tec att etc atg gtc caa gac acc ctg gac cag aac etc 8995 
His Phe Phe Ser He Leu Met Val Gin Asp Thr Leu Asp Gin Asn Leu 
2870 2875 2880 2885 

aac ttt gag atg tat gga tea gta tac tec gtg aat cct ttg gac ctt 9043 
Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val Asn Pro Leu Asp Leu 
2890 2895 2900 

cca gee ata att gag agg tta cac ggg ctt gac gee ttt tct atg cac 9091 
Pro Ala He He Glu Arg Leu His Gly Leu Asp Ala Phe Ser Met His 
2905 2910 2915 

aca tac tct cac cac gaa ctg acg egg gtg get tea gee etc aga aaa 9139 
Thr Tyr Ser His His Glu Leu Thr Arg Val Ala Ser Ala Leu Arg Lys 
2920 2925 2930 

ctt ggg gcg cca ccc etc agg gtg tgg aag agt egg get cgc gca gtc 9187 
Leu Gly Ala Pro Pro Leu Arg Val Trp Lys Ser Arg Ala Arg Ala Val 
2935 2940 2945 

agg gcg tec etc ate tec cgt gga ggg aaa gcg gee gtt tgc ggc cga 9235 
Arg Ala Ser Leu He Ser Arg Gly Gly Lys Ala Ala Val Cys Gly Arg 
2950 2955 2960 2965 

tat etc ttc aat tgg gcg gtg aag acc aag etc aaa etc act cca ttg 9283 
Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Leu 
2970 2975 2980 

ccg gag gcg cgc eta ctg gac tta tec agt tgg ttc acc gtc ggc gee 9331 
Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp Phe Thr Val Gly Ala 
2985 2990 2995 

ggc ggg ggc gac att ttt cac age gtg teg cgc gee cga ccc cgc tea 9379 
Gly Gly Gly Asp He Phe His Ser Val Ser Arg Ala Arg Pro Arg Ser 
3000 3005 3010 

tta etc ttc ggc eta etc eta ctt ttc gta ggg gta ggc etc ttc eta 9427 
Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly Val Gly Leu Phe Leu 
3015 3020 3025 

etc ccc get egg tag agcggcacac actaggtaca ctccatagct aactgttcct 9482 

Leu Pro Ala Arg 

3030 
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tttttttttt tttttttttt tttttttttt 

tctttcttcc cttctcatct tattctactt 
tcacggctag ctgtgaaagg tccgtgagcc 
tctctgcaga tcatgt 



tttttttttt ttcttttttt tttttttccc 9542 
tctttcttgg tggctccatc ttagccctag 9602 
gcatgactgc agagagtgcc gtaactggtc 9662 

9678 



<210> 4 
<211> 3033 
<212> PRT 

<213> Hepatitis C virus 



<400> 4 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

15 10 15 

Arg Arg Pro Glu Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr 

35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 .60 

He Pro Lys Asp Arg Arg Ser Thr Gly Lys Ala Trp Gly Lys Pro Gly 
65 70 75 80 

Arg Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 

85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 

100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val He Asp Thr Leu Thr Cys ' 

115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Val Val Gly Ala Pro Leu 

130 135 140 

Ser Gly Ala Ala Arg Ala Val Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Phe Pro Phe Ser He 

165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Val Pro Val Ser Ala Ala 

180 185 190 

Gin Val Lys Asn Thr Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Ser 

195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Glu Ala Ala Val Leu His Val Pro 

210 215 220 

Gly Cys Val Pro Cys Glu Arg Val Gly Asn Thr Ser Arg Cys Trp Val 
225 230 235 240 

Pro Val Ser Pro Asn Met Ala Val Arg Gin Pro Gly Ala Leu Thr Gin 

245 250 255 

Gly Leu Arg Thr His He Asp Met Val Val Met Ser Ala Thr Phe Cys 

260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
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OOA 

280 




285 




Gin Val 


rne 


T 1 

lie 


Val 


ber 


Pro 


Gin Tyr His 


Trp Phe Val 


pi., pi,, A 

Gin Glu Lys 


OA A 

290 










OAC 

295 






300 




Asn Lys 


ber 


T 1 

lie 


lyr 


Pro 


A 1 

Gly 


Ihr He Ihr 


Gly His Arg 


Met Ala lrp 


one 

305 








Ota 

310 






315 




OOA 

320 


Asp Met 


Met 


Met 


Asn 


Trp 


ber 


Pro Ihr Ala 


Thr Met He 


Leu Ala lyr 








ooc 

325 






OOA 

330 






ooc 

335 


Val Met 


Arg 


Val 


Pro 


A 1 

Glu 


T 7 _ 1 

Val 


Tl „ T 1 „ A 

He lie Asp 


He Val Ser 


A1 A 1 « TT* n 

Gly Ala His 






340 








O A C 

345 






350 


lrp Gly 


Val 


Met 


rne 


A 1 

Gly 


T 

Leu 


Ala Tyr Phe 


Ser Met Gin 


A 1 A 1 ~ T 

Gly Ala Trp 




355 










OTA 

360 




365 




A 1 A T 

Ala Lys 


T 7 1 

Val 


T 1 

lie 


T 7 _ 1 

Val 


lie 


Leu 


Leu Leu Ala 


Ala Gly Val 


A A 1 A 1 

Asp Ala Gly 


OTA 

370 










375 






380 




lhr Ihr 


Ihr 


Val 


Gly 


Gly 


A 1 

Ala 


T7^ I A 1 _ A 

Val Ala Arg 


Ser Thr Asn 


T 7 _ 1 Tl A 1 ^ 

Val He Ala 


385 








OAA 

390 






395 




A A A 

400 


Gly Val 


rne 


ber 


His 


A 1 

Gly 


Pro 


A 1 — Pl„ A . 

Gin Gin Asn 


He Gin Leu 


T 1 _ A TT— 

lie Asn Thr 








405 






J 1 A 

410 






415 


Asn Gly 


ber 


Trp 


Tjr : _ 

His 


T 1 „ 

lie 


Asn 


Arg Ihr Ala 


Leu Asn Cys 


Asn Asp Ser 






















Leu Asn 


Thr 


Gly 


rhe 


T Mm 

Leu 


Ala 


A 1 T . ] 

Ala Leu Phe 


Tyr Thr Asn 


Arg Phe Asn 




435 










A A A 

440 




445 




Ser Ser 


Gly 


Lys 


rro 


Gly 


Arg 


Leu ber Ala 


Cys Arg Asn 


lie Glu Ala 


450 










A r— i— 

455 










Phe Arg 


lie 


Gly 


Trp 


A 1 

Gly 


Thr 


Leu Gin Tyr 


Glu Asp Asn 


Val Thr Asn 


vl AC 

465 








470 






475 




yl O A 

480 


Pro Glu 


Asp 


Met 


Arg 


Pro 


Tyr 


A__— T*™ TT- — 

Lys lrp His 


Tyr Pro Pro 


Lys Pro Cys 








485 






A AA 

490 






A AC 

495 


Gly Val 


Val 


Pro 


A 1 « 

Ala 


Arg 


ber 


Val Lys Gly 


Pro Val Tyr 


Cys Phe Thr 






500 








C AC 

505 






CIA 

510 


Pro Ser 


rro 


Val 


Val 


Val 


Gly 


Ihr Ihr Asp 


Arg Arg Gly 


Val Pro Ihr 




c i c 

515 










COA 

520 




525 




lyr inr 


lrp 


A I 

Gly 


Glu 


Asn 


Glu 


Ihr Asp Val 


Phe Leu Leu 


Asn ber Inr 


C OA 

530 










c oc 

535 






540 




Arg Pro 


rro 


Gin 


Gly 


ber 


Trp 


r»u ~ A 1 A 

Phe Gly Lys 


Thr 


Trp Met 


Asn ber Thr 


545 








CCA 

550 






555 




560 


Gly Pne 


Inr 


T ___ 

Lys 


Ihr 


Lys 


Gly 


Ala Pro Pro 


Cys 


Arg Thr 


Arg Ala Asp 








C A C 

565 






570 






C^7C 

575 


rhe Asn 


A 1 ~ 

Ala 


ber 


Inr 


Asp 


Leu 


Leu Cys Pro 


Thr 


Asp Cys 


Phe Arg Lys 






rro A 

580 








coc 

585 






C AA 

590 


TT * _ T}— _ 

His rro 


Asp 


A 1 

Ala 


Inr 


Tyr 


He 


T A A 1 

Lys Lys Gly 


Ser 


Gly Pro 


lrp Leu ihr 




C AC 

595 










r aa 

600 




605 




rro Lys 


Lys 


Leu 


Val 


TT ' 

His 


Tyr 


Pro Tyr Arg 


Leu 


Trp His 


lyr Pro Lys 


ATA 

610 










615 






620 




Inr Val 


Asn 


rne 


Ihr 


lie 


rhe 


Lys He Arg 


Met 


Tyr Val 


A 1 A 1 TT^ 1 

Gly Gly Val 


coc 








630 






635 




A >1 A 

640 


uiu His 


Arg 


Leu 


inr 


A 1 o 

Aia 


Aia 


Lys Asn rne 


Thr 


Arg Gly 


Asp Arg Lys 








645 






650 






655 


Asp Leu 


Glu 


Asp 


Arg 


Asp 


Arg 


Ser Gin Leu 


Ser 


Pro Leu 


Leu His Ser 






660 








665 






670 


Thr Thr 


Glu 


Trp 


Ala 


He 


Leu 


Pro Cys Thr 


Tyr 


Ser Asp 


Leu Pro Ala 
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o75 








680 


685 


Leu 


Cam- 
ber 


lnr 


PI XT 

lily 


T Ml 

Leu 


T Ml 

Leu 


tt;_ T m i Ui« P1t% A 

His Leu His bin Asn 


Tift 17ft 1 A ft*-. TJft 1 A1- 

lie Vai Asp Vai bin 




oyu 










CAP 

o9o 


HAA 

700 


lyr 


Met 


lyr 


PI TT 

bly 


T Ml 

Leu 


ber 


rro Ala lie lnr Lys 


TV— TTft 1 \7ft 1 A —ft. T*— ^ 

lyr Vai Vai Arg Irp 


/Ob 










11 A 

710 


715 


720 


blU 


irp 


Vai 


Vai 


T mi 

Leu 


Leu 


Phe Leu Leu Leu Ala 


Asp Ala Arg Vai Cys 










TOE 

725 




730 


735 


Ala 


Lys 


T ™, 

Leu 


Irp 


Met 


T Ml 

Leu 


t i ^ t mi t mi p l T» n„ 

lie Leu Leu bly bin 


Alft Pl„ Alft Alft Tft., 

Ala biu Ala Ala Leu 








1A A 

74U 






HA C 

745 


750 


blu 


Lys 


Leu 


vai 


Vai 


T Ml 

Leu 


His Ala Ala ber Ala 


Ala Asn Lys His bly 






arr 
(DO 








760 


765 


Leu 


Leu 


lyr 


rne 


Ala 


lie 


T)!,^ T3U/-v TTft 1 A 1 ft A 1 « 

rne rne Vai Ala Ala 


T*.« TJ : ft. Tift A —ft- P 1 tt 

irp His lie Arg bly 




tta 










hhc 

775 


nOA 

780 


Arg 


vai 


vai 


rro 


T Ml 

Leu 


lnr 


lnr lyr Lys Leu lnr 


A 1 T — . . Tl— ~ ft 

bly Leu Irp rro rne 


nor 
/OD 










*7AA 

790 


795 


OAA 

800 


Lys 


Leu 


Leu 


T mi 

Leu 


Met 


A 1 ft 

Ala 


T , "D««ft A A1 _ A 1 ft 

Leu rro Arg bin Ala 


TV — Alft A A 1 « 

lyr Ala lyr Asp Ala 










QAr 

805 




O 1 A 

810 


ni P 

815 


rro 


Vai 


His 


bly 


bin 


T1 rt 

lie 


P 1 TT TTft 1 P 1 ,T T ft!! T ftl! 

bly Vai bly Leu Leu 


Tift T ft.. Tift TU « Tft.. 

lie Leu He Inr Leu 








O OA 

820 






ooc 

825 


O OA 

830 


rne 


inr 


Leu 


lnr 


rro 


P 1 TT 

bly 


T* TT „ T TTft Tl-» T ft-, i T . 

lyr Lys lnr Leu Leu 


P 1 w V A1 — r*~.r^. T ft.. T— 

bly bin Lys Leu Irp 






ooc 
OOO 








O A A 

840 


845 


TVt% 

Irp 


Leu 


Cys 


Tyr 


Leu 


T Ml 

Leu 


lnr Leu bly blu Ala 


Mft-i- Tift r*i^ r*i.. t ...^ 

Met lie bin blu Irp 




OCA 

850 










855 


or r\ 

860 


Vft. i 
vai 


rro 


rro 


Met 


p i _ 
bin 


Vai 


A t~ft. A 1 ,» A1« A «.ft> A 

Arg bly bly Arg Asp 


PItt Tift Alft f*-ft Alft 

bly He Ala Irp Ala 


obo 










OHA 

870 


875 


OO A 

880 


Vai 


lnr 


T1 n 

lie 


rne 


Ptt~ 
Lys 


rro 


P 1 tt 17ft 1 T7"ft 1 DU ft A ft«* 

bly Vai Vai rne Asp 


lie Thr Lys Trp Leu 










ooc: 

885 




OAA 

890 


o Arr 

895 


Leu 


A 1 « 

Ala 


Leu 


T Ml 

Leu 


bly 


rro 


Ala Tyr Leu Leu Arg 


Alft Alft Tft.. TU — TJ ^ « 

Ala Ala Leu Ihr His 








AAA 

900 






905 


AT A 

910 


vai 


rro 


lyr 


rne 


Vai 


Arg 


Alft U; « Aift T ft.-. Tift 

Ala Mis Ala Leu lie 


A —ft. \7ft 1 Alft T ft.. 

Arg Vai Lys Ala Leu 






A1 c 

915 








AOA 

920 


AOC 

925 


vai 


T ___ 

Lys 


p i 
bin 


Leu 


A 1 #■» 

Ala 


P 1 TT 

bly 


P!tt a —ft. Ttt*. \r**i ai„ 
bly Arg lyr Vai bin 


TjTft 1 A 1 ft T ft.. T ft.. Alft 

Vai Ala Leu Leu Ala 




AOA 

9o0 










Aor 

935 


Ail A 

940 


T All 

Leu 


bly 


Arg 


lrp 


inr 


P 1 TT 

bly 


t^i_ _^ t\,_ m Tift nr.-— a 

lnr lyr lie lyr Asp 


His Leu lnr rro Met 


945 










ACTA 

950 


Apr 

955 


A/* A 

960 


ber 


Asp 


T*-tn 

Trp 


A 1 rt 

Ala 


Ala 


ber 


Gly Leu Arg Asp Leu 


Alft TTft 1 Alft XTft 1 Pl„ 

Ala Vai Ala Vai blu 










Arr 

965 




ATA 

970 


975 


rro 


lie 


T 1 a 

lie 


rne 


ber 


rro 


Ma4. Pin T TTft T TTft T7ft 1 

Met blu Lys Lys Vai 


Tift TTft 1 T^-ft P 1 tt Alft 

lie Vai Irp bly Ala 








AOA 

980 






985 


AAA 

990 


Glu 


Thr 


Ala 


Ala 


Lys 


Gly 


Asp He Leu His Gly 


Leu Pro Vai Ser Ala 






995 








1000 


1005 


Arg 


Leu 


PI TT 

bly 


p i « 
bin 


Glu 


He 


Leu Leu Gly Pro Ala 


Asp Gly Tyr Thr Ser 


1010 








1015 


1020 


Lys Gly 


Irp 


Lys 


Leu 


Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 


1025 






1030 


1035 


1040 


Arg Gly 


Leu 


Leu 


Gly Ala He Vai Vai Ser Met Thr Gly Arg Asp Arg 








1045 




1050 


1055 


Thr Glu 


Gin 


Ala Gly Glu Vai Gin He Leu Ser Thr Vai Ser Gin Ser 



1060 1065 1070 



Phe Leu Gly Thr Thr He Ser Gly Vai Leu Trp Thr Vai Tyr His Gly 
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1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Leu Arg Gly Pro Val Thr Gin Met 

1090 1095 1100 

Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly 
1105 1110 1115 1120 

Thr Lys Ser Leu Glu Pro Cys Lys Cys Gly Ala Val Asp Leu Tyr Leu 

1125 1130 1135 

Val Thr Arg Asn Ala Asp Val lie Pro Ala Arg Arg Arg Gly Asp Lys 

1140 1145 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro He Ser Thr Leu Lys Gly Ser 

1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Val Val Gly Leu Phe 

1170 1175 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser He Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp Val Val Thr Arg Ser Pro Thr Phe Ser Asp 

1205 1210 1215 

Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 

1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 

1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 

1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr Val Met Thr Gly Glu Ala He Thr 

1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly 

1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ala Val Asp Ala Thr 

1315 1320 1325 

Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 

1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Thr Pro His Pro Asp He Glu Glu Val Gly Leu Gly Arg Glu Gly Glu 

1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala He Pro Leu Ser Cys He Lys Gly Gly 

1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 

1395 1400 1405 

Ala Ala Leu Arg Gly Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly 

1410 1415 1420 

Leu Asp Val Ser He He Pro Ala Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 

1445 1450 1455 

Asp Cys Asn Val Ala Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro 

1460 1465 1470 

Thr Phe Thr He Thr Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg 
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1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Gin Gly Thr Tyr Arg 

1490 1495 1500 

Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro 

1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 

1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 

1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly 

1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Ala Met Trp Lys Cys Leu Ala 

1605 1610 1615 

Arg Leu Lys Pro Thr Leu Ala Gly Pro Thr Pro Leu Leu Tyr Arg Leu 

1620 1625 1630 

Gly Pro He Thr Asn Glu Val Thr Leu Thr His Pro Gly Thr Lys Tyr 

1635 1640 1645 

He Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 

1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Ser He He Gly Arg Leu His Val Asn Gin Arg Val 

1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 

1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu He Glu Glu Gly Gin Arg He 

1715 1720 1725 

Ala Glu Met Leu Lys Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser 

1730 1735 1740 

Lys Gin Ala Gin Asp He Gin Pro Ala Met Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 

Val Glu Gin Phe Trp Ala Arg His Met Trp Asn Phe He Ser Gly He 

1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 

1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 

1795 1800 1805 

Thr Thr He Leu Leu Asn He Met Gly Gly Trp Leu Ala Ser Gin He 

1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 

1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser .Gly Ala Leu Val Ala Phe Lys lie 

1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val He Asn Leu Leu Pro 
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1875 



1880 



1885 



Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 

1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu lie Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 

1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 

1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 

1955 1960 1965 

Thr Glu Asp Cys Pro He Pro Cys Ser Gly Ser Trp Leu Arg Asp Val 

1970 1975 1980 

Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Leu Pro Gly Leu Pro Phe He Ser Cys Gin 

2005 2010 2015 

Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly He Met Thr Thr Arg 

2020 2025 2030 

Cys Pro Cys Gly Ala Asn He Ser Gly Asn Val Arg Leu Gly Ser Met 

2035 2040 2045 

Arg He Thr Gly Pro Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe 

2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cys Ala Pro Lys Pro Pro Thr 
2065 2070 2075 2080 

Asn Tyr Lys Thr Ala lie Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 

2085 2090 2095 

Val Thr Gin His Gly Ser Tyr Ser Tyr Val Thr Gly Leu Thr Thr Asp 

2100 2105 2110 

Asn Leu Lys He Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 

2115 2120 2125 

Val Asp Gly Val Gin He His Arg Phe Ala Pro Thr Pro Lys Pro Phe 

2130 2135 2140 

Phe Arg Asp Glu Val Ser Phe Cys Val Gly Leu Asn Ser Tyr Ala Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg 

2165 2170 2175 

Ser Met Leu Thr Asp Pro Pro His He Thr Ala Glu Thr Ala Ala Arg 

2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Val Ser 

2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Ser Asn 

2210 2215 2220 

Thr Tyr Asp Val Asp Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly 
2225 2230 2235 2240 

Val Ala Gin Thr Glu Pro Glu Ser Arg Val Pro Val Leu Asp Phe Leu 

2245 2250 2255 

Glu Pro Met Ala Glu Glu Glu Ser Asp Leu Glu Pro Ser He Pro Ser 

2260 2265 2270 

Glu Cys Met Leu Pro Arg Ser Gly Phe Pro Arg Ala Leu Pro Ala Trp 
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2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Arg Arg Pro 

2290 2295 2300 

Asp Tyr Gin Pro Pro Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys 
2305 2310 2315 2320 

Lys Ala Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 

2325 2330 2335 

Glu Ser Thr He Ser Glu Ala Leu Gin Gin Leu Ala He Lys Thr Phe 

2340 2345 2350 

Gly Gin Pro Pro Ser Ser Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly 

2355 2360 2365 

Ala Ala Glu Ser Gly Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser 

2370 2375 2380 

Glu Thr Gly Ser Ala Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Glu Ser Asp Gin Val Glu Leu Gin Pro Pro Pro Gin 

2405 2410 2415 

Gly Gly Gly Val Ala Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr Cys 

2420 2425 2430 

Ser Glu Glu Asp Asp Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp 

2435 2440 2445 

Thr Gly Ala Leu He Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 

2450 2455 2460, 

He Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Ser Lys Ser Ala Ser Gin Arg Ala Lys Lys Val Thr Phe 

2485 2490 2495 

Asp Arg Thr Gin Val Leu Asp Ala His Tyr Asp Ser Val Leu Lys Asp 

2500 2505 2510 

He Lys Leu Ala Ala Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu 

2515 2520 2525 

Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 

2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

He Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Pro Gin Thr Pro He 

2565 2570 2575 

Pro Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala 

2580 2585 2590 

Lys Gly Gly Lys Lys Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly 

2595 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp He Thr Gin Lys Leu 

2610 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 
2625 2630 2635 2640 

Gin Arg Val Glu Tyr Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro 

2645 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 

2660 2665 2670 

Arg Asp He Arg Thr Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro 
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2675 2680 2685 

Glu Glu Ala Arg Thr Ala He His Ser Leu Thr Glu Arg Leu Tyr Val 

2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg 
2705 2710 2715 2720 

Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr He Thr 

2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly He Val Ala 

2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Ser Glu Ser 

2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 

2770 2775 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 
2785 2790 2795 2800 

Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 

2805 2810 2815 

Gly Pro Arg Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 

2820 2825 2830 

Pro Leu Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro He Asn 

2835 2840 2845 

Ser Trp Leu Gly Asn He He Gin Tyr Ala Pro Thr He Trp Val Arg 

2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser He Leu Met Val Gin Asp Thr 
2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val 

2885 2890 2895 

Asn Pro Leu Asp Leu Pro Ala He He Glu Arg Leu His Gly Leu Asp 

2900 2905 2910 

Ala Phe Ser Met His Thr Tyr Ser His His Glu Leu Thr Arg Val Ala 

2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Val Trp Lys Ser 

2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu He Ser Arg Gly Gly Lys Ala 
2945 2950 2955 2960 

Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 

2965 2970 2975 

Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 

2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp He Phe His Ser Val Ser Arg 

2995 3000 3005 

Ala Arg Pro Arg Ser Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly 

3010 3015 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 



<210> 5 
<211> 9674 
<212> DNA 

tiiSE# 2003-3107783 



!$m 2003-329115 



^•—5? : 31/ 



<213> Hepatitis C virus 

<220> 
<221> CDS 

<222> (341) .. (9442) 
<400> 5 

acccgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 

cttcacgcag aaagcgtcta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120 

ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 

aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240 

caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300 

tgcttgcgag tgccccggga ggtctcgtag accgtgcacc atg age aca aat ccc 355 

Met Ser Thr Asn Pro 
1 5 

aaa cct caa aga aaa acc aaa aga aac act aac cgt cgc cca caa gac 403 
Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp 
10 15 20 

gtt aag ttt ccg ggc ggc ggc cag ate gtt ggc gga gta tac ttg ttg 451 
Val Lys Phe Pro Gly Gly Gly Gin He Val Gly Gly Val Tyr Leu Leu 
25 30 35 

ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg aca agg aag get teg 499 
Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys Ala Ser 
40 45 50 

gag egg tec cag cca cgt ggg agg cgc cag ccc ate ccc aaa cat egg 547 
Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro He Pro Lys His Arg 
55 60 65 

cgc tec act ggc aag tec tgg ggg aag cca gga tac ccc tgg ccc ctg 595 
Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly Tyr Pro Trp Pro Leu 
70 75 80 85 

tat ggg aat gag ggg etc ggt tgg gca gga tgg etc ctg tec cct cga 643 
Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 
90 95 100 

ggt tec cgt ccc tea tgg ggc ccc aat gac ccc egg cat agg teg cgc 691 
Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro Arg His Arg Ser Arg 
105 110 115 

aat gtg ggt aag gtc ate gat acc eta acg tgc ggc ttt gec gac etc 739 

mum 2003-3107783 



#M 2003-329115 



^-v : - 32/ 



Asn Val Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
120 125 130 

ttg ggg tac gtc ccc gtc gta ggc gcc ccg ctt agt ggc gtt gcc agt 787 
Leu Gly Tyr Val Pro Val Val Gly Ala Pro Leu Ser Gly Val Ala Ser 
135 140 145 

get etc gcg cac ggc gtg aga gtc ctg gag gac ggg gtt aat ttt gca 835 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Phe Ala 
150 155 160 165 

aca ggg aac tta cct ggt tgc tec ttt tct ate ttc ttg ctg gcc eta 883 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
170 175 180 

ctg tec tgc ate act act ccg gtc tct get gtc caa gtg aag aac acc 931 
Leu Ser Cys He Thr Thr Pro Val Ser Ala Val Gin Val Lys Asn Thr 
185 190 195 

age aac gcc tat atg gcg act aac gac tgt tec aat gac age ate act 979 
Ser Asn Ala Tyr Met Ala Thr Asn Asp Cys Ser Asn Asp Ser He Thr 
200 205 210 

tgg cag ctt gag gcc gca gtc etc cat gtc ccc ggg tgc gtc ccg tgc 1027 
Trp Gin Leu Glu Ala Ala Val Leu His Val Pro Gly Cys Val Pro Cys 
215 220 . 225 

gag aaa atg ggg aac aca tea egg tgc tgg ata cca gtc tea cca aac 1075 
Glu Lys Met Gly Asn Thr Ser Arg Cys Trp He Pro Val Ser Pro Asn 
230 235 240 245 

gtg get gtg egg cag cct ggc gcc etc acg egg ggc ttg egg acg cac 1123 
Val Ala Val Arg Gin Pro Gly Ala Leu Thr Arg Gly Leu Arg Thr His 
250 255 260 

ate gac atg gtc gtg ttg tec gcc acg etc tgc tec get etc tac gtg 1171 
He Asp Met Val Val Leu Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val 
265 270 275 

ggg gac etc tgt ggc ggg gtg atg etc gcg tec cag atg ttc att gtc 1219 
Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ser Gin Met Phe He Val 
280 285 290 

teg ccg cag cac cac tgg ttc gtg cag gaa tgc aat tgc tec ate tac 1267 
Ser Pro Gin His His Trp Phe Val Gin Glu Cys Asn Cys Ser He Tyr 
295 300 305 

cct ggc gcc ate act ggg cac cgt.atg gca tgg gac atg atg atg aac 1315 
Pro Gly Ala He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
310 315 320 325 
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tgg teg ccc acg acc acc atg ate ctg gcg tac gtg atg cgc gtt ccc 1363 
Trp Ser Pro Thr Thr Thr Met He Leu Ala Tyr Val Met Arg Val Pro 
330 335 340 

gag gtc ate ata gac ate att age gga get cac tgg ggc gtc atg ttt 1411 
Glu Val He He Asp He He Ser Gly Ala His Trp Gly Val Met Phe 
345 350 355 

ggc ctg gee tac ttc tct atg cag gga gcg tgg gcg aag gtc gtt gtc 1459 
Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp Ala Lys Val Val Val 
360 365 370 

ate etc ctg ctg gee tct ggg gtg gac gcg tac acc acc acg act ggg 1507 
He Leu Leu Leu Ala Ser Gly Val Asp Ala Tyr Thr Thr Thr Thr Gly 
375 380 385 

age get get ggg cgc act acc agt age ctg gee age gee ttc tec cct 1555 
Ser Ala Ala Gly Arg Thr Thr Ser Ser Leu Ala Ser Ala Phe Ser Pro 
390 395 400 405 

ggc get egg cag aac att cag etc att aat acc aat ggt age tgg cac 1603 
Gly Ala Arg Gin Asn He Gin Leu He Asn Thr Asn Gly Ser Trp His 
410 " 415 420 

ate aac cgc acc gee ctg aat tgc aac gat tec ttg cac acc ggc ttc 1651 
He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu His Thr Gly Phe 
425 430 435 

ttc acg gee ctg ttc tac ate cat aag ttc aac teg teg gga tgt ccc 1699 
Phe Thr Ala Leu Phe Tyr He His Lys Phe Asn Ser Ser Gly Cys Pro 
440 445 450 

gag cgc ctg tec gee tgt cgc aac ate gag gac ttc egg ata gga tgg 1747 
Glu Arg Leu Ser Ala Cys Arg Asn He Glu Asp Phe Arg He Gly Trp 
455 460 465 

ggc gee ctg caa tac gac gac aat gtc acc aat cca gaa gat atg agg 1795 
Gly Ala Leu Gin Tyr Asp Asp Asn Val Thr Asn Pro Glu Asp Met Arg 
470 475 480 485 

cca tat tgc tgg cac tac cca cca aaa cag tgt ggc gta gtc ccc gca 1843 
Pro Tyr Cys Trp His Tyr Pro Pro Lys Gin Cys Gly Val Val Pro Ala 
490 495 500 

ggg acc gtg tgc ggc cca gtg tac tgt ttc acc cct age ccg gtg gta 1891 
Gly Thr Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
505 510 515 

gtg ggc acg acc gat aga ctt gga gtg cct act tac acg tgg gga gag 1939 
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Val Gly Thr Thr Asp Arg Leu Gly Val Pro Thr Tyr Thr Trp Gly Glu 
520 525 530 

aat gag aca gat gtc ttc eta ttg aac age ace cga cca ccg teg ggg 1987 
Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr Arg Pro Pro Ser Gly 
535 540 545 

tea tgg ttt ggc tgc acg tgg atg aac tec. act ggc ttc ace aag acc 2035 
Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Thr 
550 555 560 565 

tgc ggc gca cca ccc tgc cgc act aga get gac ttc aat acc age aca 2083 
Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp Phe Asn Thr Ser Thr 
. 570 575 580 

gat ctg ttg tgc ccc acg gac tgt ttt aga aaa cat cct gaa gec act 2131 
Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr 
585 590 595 

tac ate aaa tgt ggt tec ggg cct tgg etc acg cca aag tgt ctg gtt 2179 
Tyr He Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Lys Cys Leu Val 
600 605 610 

gac tac ccc tac agg etc tgg cat tac cct tgc aca gtc aat tac tec 2227 
Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Tyr Ser 
615 620 625 

acc ttc aag ate agg atg tat gtg ggg gga gtt gag cac agg etc atg 2275 
Thr Phe Lys He Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Met 
630 635 640 645 

gec gcg tgc aat ttc act cgt ggg gat cgc tgc aac ttg gag gat agg 2323 
Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys Asn Leu Glu Asp Arg 
650 655 660 

gac aga agt caa cag act cct ctg ttg cac tec acc acg gaa tgg gec 2371 
Asp Arg Ser Gin Gin Thr Pro Leu Leu His Ser Thr Thr Glu Trp Ala 
665 670 675 

att ttg ccc tgc tct ttc tea gac ttg ccc get ttg teg act ggt ctt 2419 
He Leu Pro Cys Ser Phe Ser Asp Leu Pro Ala Leu Ser Thr Gly Leu 
680 685 690 

etc cac etc cac caa aat ate gtg gac gta caa tat atg tat ggc ctg 2467 
Leu His Leu His Gin Asn He Val Asp Val Gin Tyr Met Tyr Gly Leu 
695 700 705 

tea cct gee etc aca caa tat ate gtt cga tgg gag tgg gta gta etc 2515 
Ser Pro Ala Leu Thr Gin Tyr He Val Arg Trp Glu Trp Val Val Leu 
710 715 720 725 
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tta ttc ctg etc eta gcg gac gee agg gtc tgc gee tgc ttg tgg atg 2563 
Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met 
730 735 740 

etc ate ttg ctg ggc caa gec gaa gca gca ctg gag aag ctg gtc gtc 2611 
Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu Glu Lys Leu Val Val 
745 750 755 

ttg cac get gcg age gca get age tgc aat ggc ttc ctg tat ttt gtc 2659 
Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly Phe Leu Tyr Phe Val 
760 765 770 

ate ttt etc gtg get get tgg cac ate aag ggt agg gtg gtc ccc ttg 2707 
He Phe Leu Val Ala Ala Trp His He Lys Gly Arg Val Val Pro Leu 
775 780 785 

get get tat tec ctt act ggc ctg tgg ccg ttc tgc eta ctg etc eta 2755 
Ala Ala Tyr Ser Leu Thr Gly Leu Trp Pro Phe Cys Leu Leu Leu Leu 
790 795 800 805 

gca ctg ccc cag cag get tac gee tat gat gca tct gtg cac gga cag 2803 
Ala Leu Pro Gin Gin Ala Tyr Ala Tyr Asp Ala Ser Val His Gly Gin 
810 815 820 

gtg ggc gcg get ttg eta gta ctg att acc etc ttt aca etc acc ccg 2851 
Val Gly Ala Ala Leu Leu Val Leu He Thr Leu Phe Thr Leu Thr Pro 
825 830 835 

ggg tat aag acc ctt etc age cag tec ctg tgg tgg ttg tgc tat etc 2899 
Gly Tyr Lys Thr Leu Leu Ser Gin Ser Leu Trp Trp Leu Cys Tyr Leu 
840 845 850 

ctg acc ctg gcg gaa acc atg gtc cag gag tgg gca cca tec atg cag 2947 
Leu Thr Leu Ala Glu Thr Met Val Gin Glu Trp Ala Pro Ser Met Gin 
855 860 865 

gcg cgc ggc ggc cgt gat ggc ate ata tgg gec gec acc ata ttt tgc 2995 
Ala Arg Gly Gly Arg Asp Gly He He Trp Ala Ala Thr He Phe Cys 
870 875 880 885 

ccg ggc gta gtg ttt gac ata acc aag tgg etc tta gcg gtg ctt ggg 3043 
Pro Gly Val Val Phe Asp He Thr Lys Trp Leu Leu Ala Val Leu Gly 
890 895 900 

cct ggt tac etc eta aga ggt get ttg acg cgc gtg cca tat ttc gtc 3091 
Pro Gly Tyr Leu Leu Arg Gly Ala Leu Thr Arg Val Pro Tyr Phe Val 
905 910 915 

aga gec cac get ctg ctg aga atg tgc act atg gtg agg cac etc gcg 3139 

£tJiiE# 2003-31 07783 



#11 2003-329115 



36/ 



Arg Ala His Ala Leu Leu Arg Met Cys Thr Met Val Arg His Leu Ala 
920 925 930 

ggg ggt agg tac gtc cag atg gcg eta tta gec ctt ggc agg tgg act 3187 
Gly Gly Arg Tyr Val Gin Met Ala Leu Leu Ala Leu Gly Arg Trp Thr 
935 940 945 

ggc act tac ate tat gac cac etc acc cct atg teg gat tgg get get 3235 
Gly Thr Tyr He Tyr Asp His Leu Thr Pro Met Ser Asp Trp Ala Ala 
950 955 960 965 

age ggc ctg egg gac ttg gcg gtc get gtg gag cct ate ate ttc agt 3283 
Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro He He Phe Ser 
970 975 980 

ccg atg gag aag aaa gtc ate gtt tgg gga gcg gag acg get gcg tgc 3331 
Pro Met Glu Lys Lys Val He Val Trp Gly Ala Glu Thr Ala Ala Cys 
985 990 995 

ggg gac ate ttg cac gga ctt ccc gtg tec gec cga etc ggt egg gag 3379 
Gly Asp He Leu His Gly Leu Pro Val Ser Ala Arg Leu Gly Arg Glu 
1000 1005 1010 

ate etc ctt ggc cca get gat ggc tac acc tec aag ggg tgg aag ctt 3427 
He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser Lys Gly Trp Lys Leu 
1015 . 1020 1025 

etc gec ccc ate acc get tac gec cag cag aca cga ggt etc ttg ggc 3475 
Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
1030 1035 1040 1045 

tct ata gtg gtg age atg acg ggg cgt gac aag aca gaa cag gec ggg 3523 
Ser He Val Val Ser Met Thr Gly Arg Asp Lys Thr Glu Gin Ala Gly 
1050 1055 1060 

gag gtc caa gtc ctg tec aca gtc act cag tec ttc etc gga aca tec 3571 
Glu Val Gin Val Leu Ser Thr Val Thr Gin Ser Phe Leu Gly Thr Ser 
1065 1070 1075 

att teg ggg gtc tta tgg act gtt tac cac gga get ggc aac aag aca 3619 
He Ser Gly Val Leu Trp Thr Val Tyr His Gly Ala Gly Asn Lys Thr 
1080 1085 1090 

eta gec ggc teg egg ggc ccg gtc acg cag atg tac teg age gec gag 3667 
Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met Tyr Ser Ser Ala Glu 
1095 1100 1105 

ggg gac ttg gtc ggg tgg ccc age cct cct ggg acc aaa tct ttg gag 3715 
Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu 
1110 1115 . . 1120 1125 
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ccg tgt acg tgt gga gcg gtc gac ctg tat ttg gtc acg egg aac get 3763 
Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn Ala 
1130 1135 1140 



gat gtc ate ccg get cga aga cgc ggg gac aag egg gga gcg ctg etc 
Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu 
1145 1150 1155 



3811 



tec ccg aga ccc ctt teg acc ttg aag ggg tec teg ggg gga cct gtg 
Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser Ser Gly Gly Pro Val 
1160 1165 1170 



3859 



ctt tgc cct agg ggc cac get gtc gga ate ttc egg gca get gtg tgc 
Leu Cys Pro Arg Gly His Ala Val Gly He Phe Arg Ala Ala Val Cys 
1175 1180 1185 



3907 



tct egg ggt gtg get aag tec ata gat ttc ate ccc gtt gag acg etc 3955 
Ser Arg Gly Val Ala Lys Ser He Asp Phe He Pro Val Glu Thr Leu 
1190 1195 1200 1205 



gac ate gtc acg egg tct ccc acc ttt agt gac aac age aca cca cca 
Asp He Val Thr Arg Ser Pro Thr Phe Ser Asp Asn Ser Thr Pro Pro 
1210 1215 1220 



4003 



get gtg ccc cag acc tat cag gtg ggg tac ttg cac gee ccc act ggc 
Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu His Ala Pro Thr Gly 
1225 1230 1235 



4051 



agt gga aaa age acc aag gtc ccc gtc gcg tac gee gee cag ggg tat 
Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr Ala Ala Gin Gly Tyr 
1240 1245 1250 



4099 



aaa gtg ctg gtg etc aat ccc teg gtg get gee acc ctg gga ttt ggg 4147 
Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
1255 1260 1265 



gcg tac ttg tec aag gca cat ggc ate aac ccc aac att agg act gga 
Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro Asn He Arg Thr Gly 
1270 1275 1280 1285 



4195 



gtc aga act gtg acg acc ggg gag ccc att aca tac tec acg tat ggt 
Val Arg Thr Val Thr Thr Gly Glu Pro He Thr Tyr Ser Thr Tyr Gly 
1290 1295 1300 



4243 



aaa ttc etc gec gat ggg ggc tgc gca ggc ggc gec tat gac ate ate 
Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly Ala Tyr Asp He He 
1305 1310 1315 



4291 



ata tgc gat gaa tgc cac tct gtg gat get acc act att etc ggc ate 4339 

tB8E#2 0.0 3-3107783 



#£g 2003-329115 



38/ 



He Cys Asp Glu Cys His Ser Val Asp Ala Thr Thr He Leu Gly He 
1320 1325 1330 

ggg aca gtc ctt gac caa gca gag aca gcc ggg gtc agg eta act gta 4387 
Gly Thr Val Leu Asp Gin Ala Glu Thr. Ala Gly Val Arg Leu Thr Val 
1335 1340 1345 

ctg gcc acg gcc acg ccc ccc ggg teg gtg aca acc ccc cat ccc aat 4435 
Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asn 
1350 1355 1360 1365 

at a gag gag gta gcc etc gga cag gag ggt gag ate ccc ttc tat ggg 4483 
He Glu Glu Val Ala Leu Gly Gin Glu Gly Glu He Pro Phe Tyr Gly 
1370 1375 1380 

agg gcg ttt ccc ctg tct tac ate aag gga ggg agg cac ttg att ttc 4531 
Arg Ala Phe Pro Leu Ser Tyr He Lys Gly Gly Arg His Leu He Phe 
1385 1390 1395 

tgc cac tea aag aaa aag tgt gac gag etc gca acg gcc ctt egg ggc 4579 
Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Thr Ala Leu Arg Gly 
1400 1405 1410 

atg ggc ttg aac get gtg gca tat tac aga ggg ttg gac gtc tec ata 4627 
Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser lie 
1415 1420 1425 

ata cca act caa gga gat gtg gtg gtc gtt gcc acc gac gcc etc atg 4675 
He Pro Thr Gin Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met 
1430 1435 1440 1445 

acg ggg tat act gga gac ttt gac tec gtg ate gac tgc aac gta gcg 4723 
Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Val Ala 
1450 1455 1460 

gtc acc cag gcc gta gac ttc age ctg gac ccc acc ttc act ata acc 4771 
Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Thr 
1465 1470 1475 

aca cag act gtc ccg caa gac get gtc tea cgt agt cag cgc cga ggg 4819 
Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
1480 1485 1490 

cgc acg ggt aga gga aga ctg ggc att tat agg tat gtt tec act ggt 4867 
Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg Tyr Val Ser Thr Gly 
1495 1500 1505 

gag cga gcc tea gga atg ttt gac .agt gta gta etc tgt gag tgc tac 4915 
Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr 
1510 1515 1520 • 1525 
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gac gca gga get get tgg tat gag etc tea cca gtg gag acg ace gtc 4963 
Asp Ala Gly Ala Ala Trp Tyr Glu Leu Ser Pro Val Glu Thr Thr Val 
1530 1535 1540 

agg etc agg gcg tat ttc aac acg cct ggc ttg cct gtg tgc cag gac 5011 
Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
1545 1550 1555 

cac ctt gag ttt tgg gag gca gtt ttc acc ggc etc aca cac ata gac 5059 
His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly Leu Thr His He Asp 
1560 1565 1570 

get cat ttc ctt tec cag aca aag cag teg ggg gaa aat ttc gca tac 5107 
Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly Glu Asn Phe Ala Tyr 
1575 1580 1585 

tta gta gec tat cag gec aca gtg tgc gec agg gec aaa gcg ccc ccc 5155 
Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro 
1590 1595 1600 1605 

ccg tec tgg gac gtc atg tgg aag tgc ttg act cga etc aag ccc acg 5203 
Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr Arg Leu Lys Pro Thr 
1610 1615 1620 

ctt gtg ggc cct aca cct etc ctg tac cgt ttg ggc tct gtt acc aac 5251 
Leu Val Gly'Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ser Val Thr Asn 
1625 1630 1635 

gag gtc acc ctt aca cac ccc gtg aca aaa tac ate gec aca tgc atg 5299 
Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr He Ala Thr Cys Met 
1640 1645 1650 

caa get gac etc gag gtc atg acc age acg tgg gtc ctg get ggg gga 5347 
Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly Gly 
1655 1660 1665 

gtc tta gca gec gtc gec gcg tat tgc tta gcg acc ggg tgt gtt tec 5395 
Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys Val Ser 
1670 1675 1680 1685 

ate att ggc cgt tta cac ate aac cag cga get gtc gtc get ccg gac 5443 
He He Gly Arg Leu His He Asn Gin Arg Ala Val Val Ala Pro Asp 
1690 1695 1700 

aag gag gtc etc tat gag get ttt gat gag atg gag gaa tgt gec tec 5491 
Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu Glu Cys Ala Ser 
1705 1710 1715 

aga gcg get etc ctt gaa gag ggg cag egg ata gec gag atg ctg aag 5539 
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Arg Ala Ala Leu Leu Glu Glu Gly Gin Arg He Ala Glu Met Leu Lys 
1720 1725 1730 

tec aag ate caa ggc tta ttg cag caa gec tct aaa cag gee cag gac 5587 
Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser Lys Gin Ala Gin Asp 
1735 1740 1745 

ata caa ccc get gtg caa get teg tgg ccc aag atg gag caa ttc tgg 5635 
He Gin Pro Ala Val Gin Ala Ser Trp Pro Lys Met Glu Gin Phe Trp 
1750 1755 1760 1765 

gec aaa cat atg tgg aac ttc ata age ggc att cag tac etc gca gga 5683 
Ala Lys His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly 
1770 1775 1780 

ctg tea aca ctg cca ggg aac cct get gtg get tec atg atg gca ttc 5731 
Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala Ser Met Met Ala Phe 
1785 1790 1795 

age gec gec etc acc agt ccg ttg tea act age ace acc ate ctt ctt 5779 
Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser Thr Thr He Leu Leu 
1800 - 1805 1810 

aac att ctg ggg ggc tgg ctg gcg tec caa att gcg cca ccc gcg ggg 5827 
Asn He Leu Gly Gly Trp Leu Ala Ser Gin lie Ala Pro Pro Ala Gly 
1815 1820 1825 

gee act ggc ttt gtt gtc agt ggc ctg gtg gga get get gtt ggc age 5875 
Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly Ala Ala Val Gly Ser 
1830 1835 1840 1845 . 

ata ggc ttg ggt aaa gtg ctg gtg gac ate ctg gca ggg tat ggt gcg 5923 
He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
1850 1855 1860 

ggc att teg ggg gec etc gtc gcg ttt aag ate atg tct ggc gag aag 5971 
Gly He Ser Gly Ala Leu Val Ala Phe Lys He Met Ser Gly Glu Lys 
1865 1870 1875 

ccc tec atg gag gat gtc ate aac ttg ctg cct ggg att ctg tct cca 6019 
Pro Ser Met Glu Asp Val He Asn Leu Leu Pro Gly He Leu Ser Pro 
1880 1885 1890 

ggt get ctg gtg gtg gga gtc ate tgc gcg gee att ctg cgc cgc cat 6067 
Gly Ala Leu Val Val Gly Val He Cys Ala Ala He Leu Arg Arg His 
1895 1900 1905 

gtg gga ccg ggg gaa ggc gcg gtc caa tgg atg aac agg ctt ate gec 6115 
Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 
1910 1915 1920 1925 
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ttc get tec aga gga aac cac gtc gee cct act cac tac gtg acg gag 
Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr His Tyr Val Thr Glu 
1930 1935 1940 



6163 



teg gat gcg teg cag cgt gtc ace caa ctg ctt ggc tct etc act ata 
Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu Gly Ser Leu Thr He 
1945 1950 1955 



6211 



act agt eta etc agg aga ctt cac aac tgg ate act gag gat tgc ccc 
Thr Ser Leu Leu Arg Arg Leu His Asn Trp He Thr Glu Asp Cys Pro 
1960 1965 1970 



6259 



ate cca tgc gee ggc teg tgg etc cgc gat gtg tgg gac tgg gtc tgt 
He Pro Cys Ala Gly Ser Trp Leu Arg Asp Val Trp Asp Trp Val Cys 
1975 1980 1985 



6307 



acc ate eta aca gac ttt aag aac tgg ctg acc tec aag ctg ttc cca 
Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro 
1990 1995 2000 2005 



6355 



aag atg cct ggc etc ccc ttt ate tct tgc caa aag ggg tac aag ggc 
Lys Met Pro Gly Leu Pro Phe He Ser Cys Gin Lys Gly Tyr Lys Gly 
2010 2015 2020 



6403 



gtg tgg gec ggc act ggc ate atg acc aca cga tgc ccc tgc ggc gec 
Val Trp Ala Gly Thr Gly He Met Thr Thr Arg Cys Pro Cys Gly Ala 
2025 2030 2035 



6451 



aac ate tct ggc aac gtc cgc ttg ggc tct atg aga ate aca gga ccc 
Asn He Ser Gly Asn Val Arg Leu Gly Ser Met Arg He Thr Gly Pro 
2040 2045 2050 



6499 



aaa acc tgc atg aac acc tgg cag ggg acc ttt cct ate aat tgt tat 
Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe Pro He Asn Cys Tyr 
2055 2060 2065 



6547 



aca gaa ggc cag tgc ttg ccg aaa ccc gcg tta aac ttc aag acc gee 
Thr Glu Gly Gin Cys Leu Pro Lys Pro Ala Leu Asn Phe Lys Thr Ala 
2070 2075 2080 2085 



6595 



ate tgg aga gtg gcg gee tea gag tac gcg gaa gtg acg cag cac gga 
He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gin His Gly 
2090 2095 2100 



6643 



tea tat gee tat ata aca ggg ctg acc act gac aac tta aaa gtc cct 
Ser Tyr Ala Tyr He Thr Gly Leu Thr Thr Asp Asn Leu Lys Val Pro 
2105 2110 2115 



6691 



tgc caa etc ccc tct cca gag ttt ttc tct tgg gtg gac gga gta caa 6739 
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Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gin 
2120 2125 2130 

ate cat agg tec gee ccc aca cca aag ccg ttt ttc egg gat gag gtc 
He His Arg Ser Ala Pro Thr Pro Lys Pro Phe Phe Arg Asp Glu Val 
2135 2140 2145 

teg ttc age gtt ggg etc aat tea ttt gtc gtc ggg tct cag ctt ccc 
Ser Phe Ser Val Gly Leu Asn Ser Phe Val Val Gly Ser Gin Leu Pro 
2150 2155 2160 2165 

tgt gac cct gag ccc gac act gag gta gtg atg tec atg eta aca gac 
Cys Asp Pro Glu Pro Asp Thr Glu Val Val Met Ser Met Leu Thr Asp 
2170 2175 2180 



cca tec cat ate acg gcg gag get gca gcg egg cgt tta gcg egg ggg 
Pro Ser His He Thr Ala Glu Ala Ala Ala Arg Arg Leu Ala Arg Gly 
2185 2190 2195 

tea ccc cca tct gag gca age tec tea gcg age cag ctg teg gcg cca 
Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro 
2200 2205 2210 

teg ctg cga gee acc tgc ace ace cac ggt agg ace tat gat gtg gac 
Ser Leu Arg Ala Thr Cys Thr Thr His Gly Arg Thr Tyr Asp Val Asp 
2215 2220 2225 

atg gtg gat gee aac ctg ttc atg ggg ggc ggc gtg att egg ata gag 
Met Val Asp Ala Asn Leu Phe Met Gly Gly Gly Val He Arg He Glu 
2230 2235 2240 2245 

tct gag tec aaa gtg gtc gtt ctg gac tec etc gac tea atg acc gag 
Ser Glu Ser Lys Val Val Val Leu Asp Ser Leu Asp Ser Met Thr Glu 
2250 2255 2260 

gaa gag ggc gac ctt gag cct tea gta cca teg gag tat atg etc ccc 
Glu Glu Gly Asp Leu Glu Pro Ser Val Pro Ser Glu Tyr Met Leu Pro 
2265 2270 2275 

agg aag agg ttc cca ccg gee tta ccg get tgg gcg egg cct gat tac 
Arg Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp Ala Arg Pro Asp Tyr 
2280 2285 2290 



aac cca ccg ctt gtg gaa teg tgg aag agg cca gat tac caa cca ccc 
Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro Asp Tyr Gin Pro Pro 
2295 2300 2305 

act gtt gcg ggc tgt get etc ccc ccc ccc aaa aag acc ccg acg cct 
Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys Lys Thr Pro Thr Pro 
2310 2315 2320 2325 
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cct cca agg aga cgc egg aca gtg ggt ctg age gag age ace ata gga 7363 
Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Thr He Gly 
2330 2335 2340 

gat gee etc caa cag ctg gee ate aag tec ttt ggc cag ccc ccc cca 7411 
Asp Ala Leu Gin Gin Leu Ala He Lys Ser Phe Gly Gin Pro Pro Pro 
2345 2350 2355 

age ggc gat tea ggc ctt tec acg ggg gcg gac gee gee gac tec ggc 7459 
Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Asp Ala Ala Asp Ser Gly 
2360 2365 2370 

gat egg aca ccc cct gac gag ttg get ctt teg gag aca ggt tct acc 7507 
Asp Arg Thr Pro Pro Asp Glu Leu Ala Leu Ser Glu Thr Gly Ser Thr 
2375 2380 2385 

tec tec atg ccc ccc etc gag ggg gag cct ggg gac cca gac ctg gag 7555 
Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Glu 
2390 2395 2400 2405 

cct gag cag gta gag ctt caa cct cct ccc cag ggg ggg gag gca get 7603 
Pro Glu Gin Val Glu Leu Gin Pro Pro Pro Gin Gly Gly Glu Ala Ala 
2410 2415 2420 

ccc ggc teg gac teg ggg tec tgg tct act tgc tec gag gag gat gac 7651 
Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys Ser Glu Glu Asp Asp 
2425 2430 2435 

tec gtc gtg tgc tgc tec atg tea tat tec tgg acc ggg get eta ata 7699 
Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu He 
2440 2445 2450 

act cct tgt age ccc gaa gag gaa aag ttg cca att aac tec ttg age 7747 
Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro He Asn Ser Leu Ser 
2455 2460 2465 

aac teg ctg ttg cga tac cat aac aag gta tac tgt act aca tea aag 7795 
Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr Cys Thr Thr Ser Lys 
2470 2475 2480 2485 

agt gee tea eta agg get aaa aag gta act ttt gat agg atg caa gtg 7843 
Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe Asp Arg Met. Gin Val 
2490 2495 2500 

etc gac gee tat tat gat tea gtc tta aag gac ate aag eta gcg gee 7891 
Leu Asp Ala Tyr Tyr Asp Ser Val Leu Lys Asp He Lys Leu Ala Ala 
2505 2510 2515 

tec aag gtc age gca agg etc etc acc tta gag gag gcg tgc caa ttg 7939 
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Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gin Leu 
2520 2525 2530 

acc cca ccc cac tct gca aga tec aag tat ggg ttt ggg get aag gag 
Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu 
2535 2540 2545 

gtc cgc age ttg tec ggg agg gec gtc aac cac ate aag tec gtg tgg 
Val Arg Ser Leu Ser Gly Arg Ala Val Asn His He Lys Ser Val Trp 
2550 2555 2560 2565 

aag gac etc ttg gaa gac tea caa aca cca att cct aca acc ate atg 
Lys Asp Leu Leu Glu Asp Ser Gin Thr Pro He Pro Thr Thr He Met 
2570 2575 2580 

gec aaa aat gag gtg ttc tgc gtg gac ccc gec aag ggg ggt aaa aaa 
Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala Lys Gly Gly Lys Lys 
2585 2590 2595 

cca get cgc ctt ate gtt tac cct gac etc ggc gtc agg gtc tgc gag 
Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu 
2600 2605 2610 

aag atg gec ctt tat gat gtc aca caa aag ctt cct cag gcg gtg atg 
Lys Met Ala Leu Tyr Asp Val Thr Gin Lys Leu Pro Gin Ala Val Met 
2615 2620 2625 

ggg get tct tat ggc ttc cag tac tec ccc get cag egg gtg gag ttt 
Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala Gin Arg Val Glu Phe 
2630 2635 2640 2645 

etc ttg aag gca tgg gcg gaa aag aga gac cct atg ggt ttt teg tat 
Leu Leu Lys Ala Trp Ala Glu Lys Arg Asp Pro Met Gly Phe Ser Tyr 
2650 2655 2660 



gat acc cga tgc ttt gac tea acc gtc act gag aga gac ate agg act 
Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg Asp He Arg Thr 
2665 2670 2675 

gag gag tec ata tac cag gec tgc tec tta ccc gag gag gec cga act 
Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro Glu Glu Ala Arg Thr 
2680 2685 2690 

gec ata cac teg ctg act gag aga etc tat gtg gga ggg ccc atg ttc 
Ala He His Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Phe 
2695 2700 2705 

aac age aag ggc cag tec tgc ggg. tac agg cgt tgc cgc gec age ggg 
Asn Ser Lys Gly Gin Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly 
2710 2715 2720 2725 
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gtg ctt acc act agt atg ggg aac acc ate aca tgc tat gta aaa gec 8563 
Val Leu Thr Thr Ser Met Gly Asn Thr He Thr Cys Tyr Val Lys Ala 
2730 2735 2740 

eta gcg get tgc aag get gcg ggg ata att gcg ccc acg atg ctg gta 8611 
Leu Ala Ala Cys Lys Ala Ala Gly He He Ala Pro Thr Met Leu Val 
2745 2750 2755 

tgc ggc gac gac ttg gtc gtc ate tea gaa age cag ggg act gag gag 8659 
Cys Gly Asp Asp Leu Val Val He Ser Glu Ser Gin Gly Thr Glu Glu 
2760 2765 2770 

gac gag egg aac ctg aga gec ttc acg gag get atg acc agg tat tct 8707 
Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser 
2775 2780 2785 

gec cct cct ggt gac ccc ccc aga ccg gaa tat gac ctg gag eta ata 8755 
Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu He 
2790 2795 2800 2805 

aca tct tgt tec tea aac gtg tct gtg gca ctt ggc cca cag ggc cgc 8803 
Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu Gly Pro Gin Gly Arg 
2810 2815 2820 

cgc aga tac tac ctg acc aga gac ccc acc act tea att gec egg get 8851 
Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Ser He Ala Arg Ala 
2825 2830 2835 

gec tgg gaa aca gtt aga cac tec cct gtc aat tea tgg ctg gga aac 8899 
Ala Trp Glu Thr Val Arg His Ser Pro Val Asn Ser Trp Leu Gly Asn 
2840 2845 2850 

ate ate cag tac get cca acc ata tgg gtt cgc atg gtc ctg atg aca 8947 
He He Gin Tyr Ala Pro Thr He Trp Val Arg Met Val Leu Met Thr 
2855 2860 2865 

cac ttc ttc tec att etc atg gee cag gac acc eta gac cag aac ctt 8995 
His Phe Phe Ser He Leu Met Ala Gin Asp Thr Leu Asp Gin Asn Leu 
2870 2875 2880 2885 

aac ttt gaa atg tac gga teg gtg tac tec gtg agt cct ctg gac etc 9043 
Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val Ser Pro Leu Asp Leu 
2890 . 2895 2900 

cca gee ata att gaa agg tta cac ggg ctt gac gee ttc tct ctg cac 9091 
Pro Ala He He Glu Arg Leu His Gly Leu Asp Ala Phe Ser Leu His 
2905 2910 2915 

aca tac act ccc cac gaa ctg acg egg gtg get tea gee etc aga aaa 9139 
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Thr Tyr Thr Pro His Glu Leu Thr Arg Val Ala Ser Ala Leu Arg Lys 
2920 2925 2930 

ctt ggg gcg cca ccc etc aga gcg tgg aag agt egg gcg cgt gca gtt 9187 
Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser Arg Ala Arg Ala Val 
2935 2940 2945 

agg gcg tec etc ate tec cgt ggg ggg agg gcg gec gtt tgc ggt egg 9235 
Arg Ala Ser Leu He Ser Arg Gly Gly Arg Ala Ala Val Cys Gly Arg 
2950 2955 2960 2965 

tac etc ttc aac tgg gcg gtg aag acc aag etc aaa etc act cct ttg 9283 
Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Leu 
2970 2975 2980 

ccg gag gca cgc etc ctg gat ttg tec agt tgg ttt acc gtc ggc gec 9331 
Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp Phe Thr Val Gly Ala 
2985 2990 2995 

ggc ggg ggc gac att tat cac age gtg teg cgt gec cga ccc cgc eta 9379 
Gly Gly Gly Asp He Tyr His Ser Val Ser Arg Ala Arg Pro Arg Leu 
3000 3005 3010 

tta etc ctt age eta etc eta ctt tct gta ggg gta ggc etc ttc eta 9427 
Leu Leu Leu Ser Leu Leu Leu Leu Ser Val Gly Val Gly Leu Phe Leu 
3015 3020 3025 

etc ccc get cga tag agcggcacac attagctaca ctccatagct aactgttcct 9482 

Leu Pro Ala Arg 

3030 

tttttttttt tttttttttt tttttttttt tttttttctt tttttttttt tttccctctt 9542 

tcttcccttc tcatcttatt ctactttctt tcttggtggc tccatcttag ccctggtcac 9602 

ggctagctgt gaaaggtccg tgagcegcat gaetgeagag agtgccgtaa ctggtctctc 9662 

tgcagatcat gt 9674 



<210> 6 
<211> 3033 
<212> PRT 

<213> Hepatitis C virus 
<400> 6 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 
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Leu His Thr Gly Phe Phe Thr Ala Leu Phe Tyr He His Lys Phe Asn 

435 440 445 

Ser Ser Gly Cys Pro Glu Arg Leu Ser Ala Cys Arg Asn He Glu Asp 

450 455 460 

Phe Arg He Gly Trp Gly Ala Leu Gin Tyr Asp Asp Asn Val Thr Asn 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Lys Gin Cys 

485 490 495 

Gly Val Val Pro Ala Gly Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 

500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Val Pro Thr 

515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 

530 535 540 

Arg Pro Pro Ser Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr 
545 550 555 560 

Gly Phe Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 

565 570 575 

Phe Asn Thr Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 

580 585 590 

His Pro Glu Ala Thr Tyr He Lys Cys Gly Ser Gly Pro Trp Leu Thr 

595 600 605 

Pro Lys Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 

610 615 620 

Thr Val Asn Tyr Ser Thr Phe Lys He Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Met Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 

645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Gin Thr Pro Leu Leu His Ser 

660 665 670 

Thr Thr Glu Trp Ala He Leu Pro Cys Ser Phe Ser Asp Leu Pro Ala 

675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn He Val Asp Val Gin 

690 695 700 

Tyr Met Tyr Gly Leu Ser Pro Ala Leu Thr Gin Tyr He Val Arg Trp 
705 710 715 720 

Glu Trp Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 

725 730 735 

Ala Cys Leu Trp Met Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu 

740 745 750 

Glu Lys Leu Val Val Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 

755 760 765 

Phe Leu Tyr Phe Val He Phe Leu Val Ala Ala Trp His He Lys Gly 

770 775 780 

Arg Val Val Pro Leu Ala Ala Tyr Ser Leu Thr Gly Leu Trp Pro Phe 
785 790 795 800 

Cys Leu Leu Leu Leu Ala Leu Pro Gin Gin Ala Tyr Ala Tyr Asp Ala 

805 810 815 

Ser Val His Gly Gin Val Gly Ala Ala Leu Leu Val Leu He Thr Leu 
820 825 830 
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1060 1065 1070 

Phe Leu Gly Thr Ser He Ser Gly Val Leu Trp Thr Val Tyr His Gly 

1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met 

1090 1095 1100 

Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly 
1105 1110 1115 1120 

Thr Lys Ser Leu Glu Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu 

1125 1130 1135 

Val Thr Arg Asn Ala Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys 

1140 1145 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser 

1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Ala Val Gly He Phe 

1170 1175 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser He Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp He Val Thr Arg Ser Pro Thr Phe Ser Asp 

1205 1210 1215 

Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 
1220 1225 1230 
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His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 

1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 

1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr Val Thr Thr Gly Glu Pro He Thr 

1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly 

1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Val Asp Ala Thr 

1315 1320 1325 

Thr He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 

1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Thr Pro His Pro Asn lie Glu Glu Val Ala Leu Gly Gin Glu Gly Glu 

1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala Phe Pro Leu Ser Tyr He Lys Gly Gly 

1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 

1395 1400 1405 

Thr Ala Leu Arg Gly Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly 

1410 1415 1420 

Leu Asp Val Ser He He Pro Thr Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 

1445 1450 1455 

Asp Cys Asn Val Ala Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro 

1460 1465 1470 

Thr Phe Thr He Thr Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg 

1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg 

1490 1495 1500 

Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Glu Leu Ser Pro 

1525 1530 1535 

Val Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 

1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 

1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 

1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Val Ala Tyr Gin Ala Thr. Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr 

1605 . 1610 1615 

Arg Leu Lys Pro Thr Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
1620 1625 1630 
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Gly Ser Val Thr Asn Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr 

1635 1640 1645 

He Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 

1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Ser He He Gly Arg Leu His He Asn Gin Arg Ala 

1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 

1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu Leu Glu Glu Gly Gin Arg He 

1715 1720 1725 

Ala Glu Met Leu Lys Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser 

1730 1735 1740 

Lys Gin Ala Gin Asp He Gin Pro Ala Val Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 

Met Glu Gin Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 

1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 

1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 

1795 1800 1805 

Thr Thr He Leu Leu Asn He Leu Gly Gly Trp Leu Ala Ser Gin He 

1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 

1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He 

1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val He Asn Leu Leu Pro 

1875 1880 1885 

Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 

1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 

1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 

1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 

1955 1960 1965 

Thr Glu Asp Cys Pro He Pro Cys Ala Gly Ser Trp Leu Arg Asp Val 

1970 1975 1980 

Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Met Pro Gly Leu Pro Phe He Ser Cys Gin 

2005 2010 2015 

Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly He Met Thr Thr Arg 
2020 2025 2030 
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Cys Pro Cys Gly Ala Asn He Ser Gly Asn Val Arg Leu Gly Ser Met 

2035 2040 2045 

Arg He Thr Gly Pro Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe 

2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cys Leu Pro Lys Pro Ala Leu 
2065 2070 2075 2080 

Asn Phe Lys Thr Ala He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 

2085 2090 2095 

Val Thr Gin His Gly Ser Tyr Ala Tyr He Thr Gly Leu Thr Thr Asp 

2100 2105 2110 

Asn Leu Lys Val Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 

2115 2120 2125 

Val Asp Gly Val Gin He His Arg Ser Ala Pro Thr Pro Lys Pro Phe 

2130 2135 2140 

Phe Arg Asp Glu Val Ser Phe Ser Val Gly Leu Asn Ser Phe Val Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Asp Pro Glu Pro Asp Thr Glu Val Val Met 

2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Ala Ala Ala Arg 

2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser 

2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Gly Arg 

2210 . 2215 2220 

Thr Tyr Asp Val Asp Met Val Asp Ala Asn Leu Phe Met Gly Gly Gly 
2225 2230 . 2235 2240 

Val He Arg He Glu Ser Glu Ser Lys Val Val Val Leu Asp Ser Leu 

2245 2250 2255 

Asp Ser Met Thr Glu Glu Glu Gly Asp Leu Glu Pro Ser Val Pro Ser 

2260 2265 2270 

Glu Tyr Met Leu Pro Arg Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro 

2290 2295 2300 

Asp Tyr Gin Pro Pro Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys 
2305 2310 2315 2320 

Lys Thr Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 

2325 2330 2335 

Glu Ser Thr He Gly Asp Ala Leu Gin Gin Leu Ala He Lys Ser Phe 

2340 2345 2350 

Gly Gin Pro Pro Pro Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Asp 

2355 2360 2365 

Ala Ala Asp Ser Gly Asp Arg Thr Pro Pro Asp Glu Leu Ala Leu Ser 

2370 2375 2380 

Glu Thr Gly Ser Thr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 . 2390 2395 2400 

Asp Pro Asp Leu Glu Pro Glu Gin Val Glu Leu Gin Pro Pro Pro Gin 

2405 2410 2415 

Gly Gly Glu Ala Ala Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys 
2420 2425 2430 
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Ser Glu Glu Asp Asp Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp 

2435 2440 2445 

Thr Gly Ala Leu He Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 

2450 2455 2460 

He Asn Ser Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Ser Lys Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe 

2485 2490 2495 

Asp Arg Met Gin Val Leu Asp Ala Tyr Tyr Asp Ser Val Leu Lys Asp 

2500 2505 2510 

He Lys Leu Ala Ala Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu 

2515 2520 2525 

Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 

2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

He Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Gin Thr Pro He 

2565 2570 2575 

Pro Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala 

2580 2585 2590 

Lys Gly Gly Lys Lys Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly 

2595 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Thr Gin Lys Leu 

2610 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 
2625 2630 2635 2640 

Gin Arg Val Glu Phe Leu Leu Lys Ala Trp Ala Glu Lys Arg Asp Pro 

2645 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 

2660 2665 2670 

Arg Asp He Arg Thr Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro 

2675 2680 2685 

Glu Glu Ala Arg Thr Ala He His Ser Leu Thr Glu Arg Leu Tyr Val 

2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Ser Cys Gly Tyr Arg Arg 
2705 2710 2715 2720 

Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr He Thr 

2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly He He Ala 

2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Ser Glu Ser 

2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 

2770 2775 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 
2785 2790 2795 2800 

Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 

2805 2810 2815 

Gly Pro Gin Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 
2820 2825 2830 
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Ser lie Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro Val Asn 

2835 2840 2845 

Ser Trp Leu Gly Asn He He Gin Tyr Ala Pro Thr lie Trp Val Arg 

2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser He Leu Met Ala Gin Asp Thr 
2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val 

2885 2890 2895 

Ser Pro Leu Asp Leu Pro Ala He He Glu Arg Leu His Gly Leu Asp 

2900 2905 2910 

Ala Phe Ser Leu His Thr Tyr Thr Pro His Glu Leu Thr Arg Val Ala 

2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser 

2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu He Ser Arg Gly Gly Arg Ala 
2945 2950 2955 2960 

Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 

2965 2970 2975 

Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 

2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp He Tyr His Ser Val Ser Arg 

2995 3000 3005 

Ala Arg Pro Arg Leu Leu Leu Leu Ser Leu Leu Leu Leu Ser Val Gly 

3010 3015 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 



<210> 7 
<211> 8024 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 
<400> 7 

accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgccua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
cgcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa auccuaaacc 360 
ucaaagaaaa accaaaagaa acaccaaccg ucgcccaaug auugaacaag auggauugca 420 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 480 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 540 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 720 
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uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 780 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 840 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 900 
cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 1020 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 1140 
ucccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguuuaaac 1200 
ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucii ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgaugau accauggcuc ccaucacugc uuaugcccag 1800 
caaacacgag gccuccuggg cgccauagug gugaguauga cggggcguga caggacagaa 1860 
caggccgggg aaguccaaau ccuguccaca gucucucagu ccuuccucgg aacaaccauc 1920 
ucggggguuu uguggacugu uuaccacgga gcuggcaaca agacucuagc cggcuuacgg 1980 
gguccgguca cgcagaugua cucgagugcu gagggggacu ugguaggcug gcccagcccc 2040 
ccugggacca agucuuugga gccgugcaag uguggagccg ucgaccuaua ucuggucacg 2100 
cggaacgcug augucauccc ggcucggaga cgcggggaca agcggggagc auugcucucc 2160 
ccgagaccca uuucgaccuu gaaggggucc ucgggggggc cggugcucug cccuaggggc 2220 
cacgucguug ggcucuuccg agcagcugug ugcucucggg gcguggccaa auccaucgau 2280 
uucauccccg uugagacacu cgacguuguu acaaggucuc ccacuuucag ugacaacagc 2340 
acgccaccgg cugugcccca gaccuaucag gucggguacu ugcaugcucc aacuggcagu 2400 
ggaaagagca ccaagguccc ugucgcguau gccgcccagg gguacaaagu acuagugcuu 2460 
aaccccucgg uagcugccac ccugggguuu ggggcguacc uauccaaggc acauggcauc 2520 
aaucccaaca uuaggacugg agucaggacc gugaugaccg gggaggccau cacguacucc 2580 
acauauggca aauuucucgc cgaugggggc ugcgcuagcg gcgccuauga caucaucaua 2640 
ugcgaugaau gccacgcugu ggaugcuacc uccauucucg gcaucggaac gguccuugau 2700 
caagcagaga cagccggggu cagacuaacu gugcuggcua cggccacacc ccccggguca 2760 
gugacaaccc cccaucccga uauagaaaag guaggccucg ggcgggaggg ugagaucccc 2820 
uucuauggga gggcgauucc ccuauccugc aucaagggag ggagacaccu gauuuucugc 2880 
cacucaaaga aaaaguguga cgagcucgcg gcggcccuuc ggggcauggg cuugaaugcc 2940 
guggcauacu auagaggguu ggacgucucc auaauaccag cucagggaga uguggugguc 3000 
gucgccaccg acgcccucau gacgggguac acuggagacu uugacuccgu gaucgacugc 3060 
aauguagcgg ucacccaagc ugucgacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cacaagacgc ugucucacgc agucagcgcc gcgggcgcac agguagagga 3180 
agacagggca cuuauaggua uguuuccacu ggugaacgag ccucaggaau guuugacagu 3240 
guagugcuuu gugagugcua cgacgcaggg gcugcguggu acgaucucac accagcggag 3300 
accaccguca ggcuuagagc guauuucaac acgcccggcc uacccgugug ucaagaccau 3360 
cuugaauuuu gggaggcagu uuucaccggc cucacacaca uagacgccca cuuccucucc 3420 
caaacaaagc aagcggggga gaacuucgcg uaccuaguag ccuaccaagc uacggugugc 3480 
gccagagcca aggccccucc cccguccugg gacgccaugu ggaagugecu ggcccgacuc 3540 
aagccuacgc uugcgggccc cacaccucuc cuguaccguu ugggcccuau uaccaaugag 3600 
gucacccuca cacacccugg gacgaaguac aucgccacau gcaugcaagc ugaccuugag 3660 
gucaugacca gcacgugggu ccuagcugga ggaguccugg cagccgucgc cgcauauugc 3720 
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cuggcgacug gaugcguuuc caucaucggc cgcuugcacg ucaaccagcg agucgucguu 3780 
gcgccggaua aggagguccu guaugaggcu uuugaugaga uggaggaaug cgccucuagg 3840 
gcggcucuca ucgaagaggg gcagcggaua gccgagaugu ugaaguccaa gauccaaggc 3900 
uugcugcagc aggccucuaa gcaggcccag gacauacaac ccgcuaugca ggcuucaugg 3960 
cccaaagugg aacaauuuug ggccagacac auguggaacu ucauuagcgg cauccaauac 4020 
cucgcaggau ugucaacacu gccagggaac cccgcggugg cuuccaugau ggcauucagu 4080 
gccgcccuca ccaguccguu gucgaccagu accaccaucc uucucaacau caugggaggc 4140 
ugguuagcgu cccagaucgc accacccgcg ggggccaccg gcuuugucgu caguggccug 4200 
gugggggcug ccgugggcag cauaggccug gguaaggugc ugguggacau ccuggcagga 4260 
uauggugcgg gcauuucggg ggcccucguc gcauucaaga ucaugucugg cgagaagccc 4320 
ucuauggaag augucaucaa ucuacugccu gggauccugu cuccgggagc ccugguggug 4380 
ggggucaucu gcgcggccau ucugcgccgc cacgugggac cgggggaggg cgcgguccaa 4440 
uggaugaaca ggcuuauugc cuuugcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gugacggagu cggaugcguc gcagcgugug acccaacuac uuggcucucu uacuauaacc 4560 
agccuacuca gaagacucca caauuggaua acugaggacu gccccauccc augcuccgga 4620 
uccuggcucc gcgacgugug ggacuggguu ugcaccaucu ugacagacuu caaaaauugg 4680 
cugaccucua aauuguuccc caagcugccc ggccuccccu ucaucucuug ucaaaagggg 4740 
uacaagggug ugugggccgg cacuggcauc augaccacgc gcugcccuug cggcgccaac 4800 
aucucuggca auguccgccu gggcucuaug aggaucacag ggccuaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugc uacacggagg gccagugcgc gccgaaaccc 4920 
cccacgaacu acaagaccgc caucuggagg guggcggccu cggaguacgc ggaggugacg 4980 
cagcaugggu cguacuccua uguaacagga cugaccacug acaaucugaa aauuccuugc 5040 
caacuaccuu cuccagaguu uuucuccugg guggacggug ugcagaucca uagguuugca 5100 
cccacaccaa agccguuuuu ccgggaugag gucucguucu gcguugggcu uaauuccuau 5160 
gcugucgggu cccagcuucc cugugaaccu gagcccgacg cagacguauu gagguccaug 5220 
cuaacagauc cgccccacau cacggcggag acugcggcgc ggcgcuuggc acggggauca 5280 
ccuccaucug aggcgagcuc cucagugagc cagcuaucag caccgucgcu gcgggccacc 5340 
ugcaccaccc acagcaacac cuaugacgug gacauggucg augccaaccu gcucauggag 5400 
ggcggugugg cucagacaga gccugagucc agggugcccg uucuggacuu ucucgagcca 5460 
auggccgagg aagagagcga ccuugagccc ucaauaccau cggagugcau gcuccccagg 5520 
agcggguuuc cacgggccuu accggcuugg gcacggccug acuacaaccc gccgcucgug 5580 
gaaucgugga ggaggccaga uuaccaaccg cccaccguug cugguugugc ucuccccccc 5640 
cccaagaagg ccccgacgcc ucccccaagg agacgccgga cagugggucu gagcgagagc 5700 
accauaucag aagcccucca gcaacuggcc aucaagaccu uuggccagcc ccccucgagc 5760 
ggugaugcag gcucguccac gggggcgggc gccgccgaau ccggcggucc gacguccccu 5820 
ggugagccgg cccccucaga gacagguucc gccuccucua ugcccccccu cgagggggag 5880 
ccuggagauc cggaccugga gucugaucag guagagcuuc aaccuccccc ccaggggggg 5940 
gggguagcuc ccgguucggg cucggggucu uggucuacuu gcuccgagga ggacgauacc 6000 
accgugugcu gcuccauguc auacuccugg accggggcuc uaauaacucc cuguagcccc 6060 
gaagaggaaa aguugccaau caacccuuug aguaacucgc uguugcgaua ccauaacaag 6120 
guguacugua caacaucaaa gagcgccuca cagagggcua aaaagguaac uuuugacagg 6180 
acgcaagugc ucgacgccca uuaugacuca gucuuaaagg acaucaagcu agcggcuucc 6240 
aaggucagcg caaggcuccu caccuuggag gaggcgugcc aguugacucc accccauucu 6300 
gcaagaucca aguauggauu cggggccaag gagguccgca gcuuguccgg gagggccguu 6360 
aaccacauca aguccgugug gaaggaccuc cuggaagacc cacaaacacc aauucccaca 6420 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaagaaacca 6480 
gcucgccuca ucguuuaccc ugaccucggc guccgggucu gcgagaaaau ggcccucuau 6540 
gacauuacac aaaagcuucc ucaggcggua augggagcuu ccuauggcuu ccaguacucc 6600 
ccugcccaac ggguggagua ucucuugaaa gcaugggcgg aaaagaagga ccccaugggu 6660 
uuuucguaug auacccgaug cuucgacuca accgucacug agagagacau caggaccgag 6720 
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gaguccauau accaggccug cucccugccc gaggaggccc gcacugccau acacucgcug 6780 
acugagagac uuuacguagg agggcccaug uucaacagca agggucaaac cugcgguuac 6840 
agacguugcc gcgccagcgg ggugcuaacc acuagcaugg guaacaccau cacaugcuau 6900 
gugaaagccc uagcggccug caaggcugcg gggauaguug cgcccacaau gcugguaugc 6960 
ggcaaugacc uaguagucau cucagaaagc caggggacug aggaggacga gcggaaccug 7020 
agagccuuca cggaggccau gaccagguac ucugccccuc cuggugaucc ccccagaccg 7080 
gaauaugacc uggagcuaau aacauccugu uccucaaaug ugucuguggc guugggcccg 7140 
cggggccgcc gcagauacua ccugaccaga gacccaacca cuccacucgc ccgggcugcc 7200 
ugggaaacag uuagacacuc cccuaucaau ucauggcugg gaaacaucau ccaguaugcu 7260 
ccaaccauau ggguucgcau gguccuaaug acacacuucu ucuccauucu caugguccaa 7320 
gacacccugg accagaaccu caacuuugag auguauggau caguauacuc cgugaauccu 7380 
uuggaccuuc cagccauaau ugagagguua cacgggcuug acgccuuuuc uaugcacaca 7440 
uacucucacc acgaacugac gcggguggcu ucagcccuca gaaaacuugg ggcgccaccc 7500 
cucagggugu ggaagagucg ggcucgcgca gucagggcgu cccucaucuc ccguggaggg 7560 
aaagcggccg uuugcggccg auaucucuuc aauugggcgg ugaagaccaa gcucaaacuc 7620 
acuccauugc cggaggcgcg ccuacuggac uuauccaguu gguucaccgu cggcgccggc 7680 
gggggcgaca uuuuucacag cgugucgcgc gcccgacccc gcucauuacu cuucggccua 7740 
cuccuacuuu ucguaggggu aggccucuuc cuacuccccg cucgguagag cggcacacac 7800 
uagguacacu ccauagcuaa cuguuccuuu uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu 7860 
uuuuuuuuuu cuuuuuuuuu uuuuucccuc uuucuucccu ucucaucuua uucuacuuuc 7920 
uuucuuggug gcuccaucuu agcccuaguc acggcuagcu gugaaagguc cgugagccgc 7980 
augacugcag agagugccgu aacuggucuc ucugcagauc augu 8024 



<210> 8 
<211> 7994 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 
<400> 8 

accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgccua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
cgcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa auccuaaacc 360 
ucaaagaaaa accaaaagaa acaccaaccg ucgcccaaug auugaacaag auggauugca 420 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 480 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 540 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 720 
uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 780 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 840 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 900 
cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcutigccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 1020 
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cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 1140 
ucccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguuuaaac 1200 
ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgaugau accauggcuc ccaucacugc uuaugcccag 1800 
caaacacgag gccuccuggg cgccauagug gugaguauga cggggcguga caggacagaa 1860 
caggccgggg aaguccaaau ccuguccaca gucucucagu ccuuccucgg aacaaccauc 1920 
ucggggguuu uguggacugu uuaccacgga gcuggcaaca agacucuagc cggcuuacgg 1980 
gguccgguca cgcagaugua cucgagugcu gagggggacu ugguaggcug gcccagcccc 2040 
ccugggacca agucuuugga gccgugcaag uguggagccg ucgaccuaua ucuggucacg 2100 
cggaacgcug augucauccc ggcucggaga cgcggggaca agcggggagc auugcucucc 2160 
ccgagaccca uuucgaccuu gaaggggucc ucgggggggc cggugcucug cccuaggggc 2220 
cacgucguug ggcucuuccg agcagcugug ugcucucggg gcguggccaa auccaucgau 2280 
uucauccccg uugagacacu cgacguuguu acaaggucuc ccacuuucag ugacaacagc 2340 
acgccaccgg cugugcccca gaccuaucag gucggguacu ugcaugcucc aacuggcagu 2400 
ggaaagagca ccaagguccc ugucgcguau gccgcccagg gguacaaagu acuagugcuu 2460 
aaccccucgg uagcugccac ccugggguuu ggggcguacc uauccaaggc acauggcauc 2520 
aaucccaaca uuaggacugg agucaggacc gugaugaccg gggaggccau cacguacucc 2580 
acauauggca aauuucucgc cgaugggggc ugcgcuagcg gcgccuauga caucaucaua 2640 
ugcgaugaau gccacgcugu ggaugcuacc uccauucucg gcaucggaac gguccuugau 2700 
caagcagaga cagccggggu cagacuaacu gugcuggcua cggccacacc ccccggguca 2760 
gugacaaccc cccaucccga uauagaagag guaggccucg ggcgggaggg ugagaucccc 2820 
uucuauggga gggcgauucc ccuauccugc aucaagggag ggagacaccu gauuuucugc 2880 
cacucaaaga aaaaguguga cgagcucgcg gcggcccuuc ggggcauggg cuugaaugcc 2940 
guggcauacu auagaggguu ggacgucucc auaauaccag cucagggaga uguggugguc 3000 
gucgccaccg acgcccucau gacgggguac acuggagacu uugacuccgu gaucgacugc 3060 
aauguagcgg ucacccaagc ugucgacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cacaagacgc ugucucacgc agucagcgcc gcgggcgcac agguagagga 3180 
agacagggca cuuauaggua uguuuccacu ggugaacgag ccucaggaau guuugacagu 3240 
guagugcuuu gugagugcua cgacgcaggg gcugcguggu acgaucucac accagcggag 3300 
accaccguca ggcuuagagc guauuucaac acgcccggcc uacccgugug ucaagaccau 3360 
cuugaauuuu gggaggcagu uuucaccggc cucacacaca uagacgccca cuuccucucc 3420 
caaacaaagc aagcggggga gaacuucgcg uaccuaguag ccuaccaagc uacggugugc 3480 
gccagagcca aggccccucc cccguccugg gacgccaugu ggaagugccu ggcccgacuc 3540 
aagccuacgc uugcgggccc cacaccucuc cuguaccguu ugggcccuau uaccaaugag 3600 
gucacccuca cacacccugg gacgaaguac aucgccacau gcaugcaagc ugaccuugag 3660 
gucaugacca gcacgugggu ccuagcugga ggaguccugg cagccgucgc cgcauauugc 3720 
cuggcgacug gaugcguuuc caucaucggc cgcuugcacg ucaaccagcg agucgucguu 3780 
gcgccggaua aggagguccu guaugaggcu uuugaugaga uggaggaaug cgccucuagg 3840 
gcggcucuca ucgaagaggg gcagcggaua gccgagaugu ugaaguccaa gauccaaggc 3900 
uugcugcagc aggccucuaa gcaggcccag gacauacaac ccgcuaugca ggcuucaugg 3960 
cccaaagugg aacaauuuug ggccagacac auguggaacu ucauuagcgg cauccaauac 4020 
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cucgcaggau ugucaacacu gccagggaac cccgcggugg cuuccaugau ggcauucagu 4080 
gccgcccuca ccaguccguu gucgaccagu accaccaucc uucucaacau caugggaggc 4140 
ugguuagcgu cccagaucgc accacccgcg ggggccaccg gcuuugucgu caguggccug 4200 
gugggggcug ccgugggcag cauaggccug gguaaggugc ugguggacau ccuggcagga 4260 
uauggugcgg gcauuucggg ggcccucguc gcauucaaga ucaugucugg cgagaagccc 4320 
ucuauggaag augucaucaa ucuacugccu gggauccugu cuccgggagc ccugguggug 4380 
ggggucaucu gcgcggccau ucugcgccgc cacgugggac cgggggaggg cgcgguccaa 4440 
uggaugaaca ggcuuauugc cuuugcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gugacggagu cggaugcguc gcagcgugug acccaacuac uuggcucucu uacuauaacc 4560 
agccuacuca gaagacucca caauuggaua acugaggacu gccccauccc augcuccgga 4620 
uccuggcucc gcgacgugug ggacuggguu ugcaccaucu ugacagacuu caaaaauugg 4680 
cugaccucua aauuguuccc caagcugccc ggccuccccu ucaucucuug ucaaaagggg 4740 
uacaagggug ugugggccgg cacuggcauc augaccacgc gcugcccuug cggcgccaac 4800 
aucucuggca auguccgccu gggcucuaug aggaucacag ggccuaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugc uacacggagg gccagugcgc gccgaaaccc 4920 
cccacgaacu acaagaccgc caucuggagg guggcggccu cggaguacgc ggaggugacg 4980 
cagcaugggu cguacuccua uguaacagga cugaccacug acaaucugaa aauuccuugc 5040 
caacuaccuu cuccagaguu uuucuccugg guggacggug ugcagaucca uagguuugca 5100 
cccacaccaa agccguuuuu ccgggaugag gucucguucu gcguugggcu uaauuccuau 5160 
gcugucgggu cccagcuucc cugugaaccu gagcccgacg cagacguauu gagguccaug 5220 
cuaacagauc cgccccacau cacggcggag acugcggcgc ggcgcuuggc acggggauca 5280 
ccuccaucug aggcgagcuc cucagugagc cagcuaucag caccgucgcu gcgggccacc 5340 
ugcaccaccc acagcaacac cuaugacgug gacauggucg augccaaccu gcucauggag 5400 
ggcggugugg cucagacaga gccugagucc agggugcccg uucuggacuu ucucgagcca 5460 
auggccgagg aagagagcga ccuugagccc ucaauaccau cggagugcau gcuccccagg 5520 
agcggguuuc cacgggccuu accggcuugg gcacggccug acuacaaccc gccgcucgug 5580 
gaaucgugga ggaggccaga uuaccaaccg cccaccguug cugguugugc ucuccccccc 5640 
cccaagaagg ccccgacgcc ucccccaagg agacgccgga cagugggucu gagcgagagc 5700 
accauaucag aagcccucca gcaacuggcc aucaagaccu uuggccagcc ccccucgagc 5760 
ggugaugcag gcucguccac gggggcgggc gccgccgaau ccggcggucc gacguccccu 5820 
ggugagccgg cccccucaga gacagguucc gccuccucua ugcccccccu cgagggggag 5880 
ccuggagauc cggaccugga gucugaucag guagagcuuc aaccuccccc ccaggggggg 5940 
gggguagcuc ccgguucggg cucggggucu uggucuacuu gcuccgagga ggacgauacc 6000 
accgugugcu gcuccauguc auacuccugg accggggcuc uaauaacucc cuguagcccc 6060 
gaagaggaaa aguugccaau caacccuuug aguaacucgc uguugcgaua ccauaacaag 6120 
guguacugua caacaucaaa gagcgccuca cagagggcua aaaagguaac uuuugacagg 6180 
acgcaagugc ucgacgccca uuaugacuca gucuuaaagg acaucaagcu agcggcuucc 6240 
aaggucagcg caaggcuccu caccuuggag gaggcgugcc aguugacucc accccauucu 6300 
gcaagaucca aguauggauu cggggccaag gagguccgca gcuuguccgg gagggccguu 6360 
aaccacauca aguccgugug gaaggaccuc cuggaagacc cacaaacacc aauucccaca 6420 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaagaaacca 6480 
gcucgccuca ucguuuaccc ugaccucggc guccgggucu gcgagaaaau ggcccucuau 6540 
gacauuacac aaaagcuucc ucaggcggua augggagcuu ccuauggcuu ccaguacucc 6600 
ccugcccaac ggguggagua ucucuugaaa gcaugggcgg aaaagaagga ccccaugggu 6660 
uuuucguaug auacccgaug cuucgacuca accgucacug agagagacau caggaccgag 6720 
gaguccauau accaggccug cucccugccc gaggaggccc gcacugccau acacucgcug 6780 
acugagagac uuuacguagg agggcccaug uucaacagca agggucaaac cugcgguuac 6840 
agacguugcc gcgccagcgg ggugcuaacc acuagcaugg guaacaccau cacaugcuau 6900 
gugaaagccc uagcggccug caaggcugcg gggauaguug cgcccacaau cucagaaagc 6960 
caggggacug aggaggacga gcggaaccug agagccuuca cggaggccau gaccagguac 7020 
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ucugccccuc cuggugaucc ccccagaccg gaauaugacc uggagcuaau aacauccugu 7080 
uccucaaaug ugucuguggc guugggcccg cggggccgcc gcagauacua ccugaccaga 7140 
gacccaacca cuccacucgc ccgggcugcc ugggaaacag uuagacacuc cccuaucaau 7200 
ucauggcugg gaaacaucau ccaguaugcu ccaaccauau ggguucgcau gguccuaaug 7260 
acacacuucu ucuccauucu caugguccaa gacacccugg accagaaccu caacuuugag 7320 
auguauggau caguauacuc cgugaauccu uuggaccuuc cagccauaau ugagagguua 7380 
cacgggcuug acgccuuuuc uaugcacaca uacucucacc acgaacugac gcggguggcu 7440 
ucagcccuca gaaaacuugg ggcgccaccc cucagggugu ggaagagucg ggcucgcgca 7500 
gucagggcgu cccucaucuc ccguggaggg aaagcggccg uuugcggccg auaucucuuc 7560 
aauugggcgg ugaagaccaa gcucaaacuc acuccauugc cggaggcgcg ccuacuggac 7620 
uuauccaguu gguucaccgu cggcgccggc gggggcgaca uuuuucacag cgugucgcgc 7680 
gcccgacccc gcucauuacu cuucggccua cuccuacuuu ucguaggggu aggccucuuc 7740 
cuacuccccg cucgguagag cggcacacac uagguacacu ccauagcuaa cuguuccuuu 7800 
uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu cuuuuuuuuu uuuuucccuc 7860 
uuucuucccu ucucaucuua uucuacuuuc uuucuuggug gcuccaucuu agcccuaguc 7920 
acggcuagcu gugaaagguc cgugagccgc augacugcag agagugccgu aacuggucuc 7980 
ucugcagauc augu 7994 



<210> 9 
<211> 340 
<212> RNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic RNA 
<400> 9 

accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgccua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
cgcuugcgag ugccccggga ggucucguag accgugcacc 340 



<210> 10 
<211> 340 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic RNA 
<400> 10 

acccgccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgucua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
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ugcuugcgag ugccccggga ggucucguag accgugcacc 340 



<210> 11 
<211> 236 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic RNA 
<400> 11 

agcggcacac acuagguaca cuccauagcu aacuguuccu uuuuuuuuuu uuuuuuuuuu 60 
uuuuuuuuuu uuuuuuuuuu uucuuuuuuu uuuuuuuccc ucuuucuucc cuucucaucu 120 
uauucuacuu ucuuucuugg uggcuccauc uuagcccuag ucacggcuag cugugaaagg 180 
uccgugagcc gcaugacugc agagagugcc guaacugguc ucucugcaga ucaugu 236 



<210> 12 
<211> 232 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic RNA 
<400> 12 

agcggcacac auuagcuaca cuccauagcu aacuguuccu uuuuuuuuuu uuuuuuuuuu 60 
uuuuuuuuuu uuuuuuiicuu uuuuuuuuuu uuucccucuu ucuucccuuc ucaucuuauu 120 
cuacuuucuu ucuugguggc uccaucuuag cccuggucac ggcuagcugu gaaagguccg 180 
ugagccgcau gacugcagag agugccguaa cuggucucuc ugcagaucau gu 232 



<210> 13 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 13 

cgggagagcc atagtgg 17 



<210> 14 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial 
<400> 14 

agtaccacaa ggcctttcg 

<210> 15 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 15 

ctgcggaacc ggtgagtaca c 

<210> 16 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 16 

aacaagatgg attgcacgca 



Sequence: synthetic DNA 

19 

Sequence: synthetic DNA 

21 

Sequence: synthetic DNA 

20 



<210> 17 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 17 

cgtcaagaag gcgatagaag 20 



<210> 18 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

ffilE# 2 0 03-3107783 



#g|2 003-329115 



^-v: 63/ 



<400> 18 



gcactctctg cagtcatgcg gctcacggac 



30 



<210> 19 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 19 

cccctgtgag gaactactgt cttcacgc 28 



<210> 20 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 



<210> 21 
* <211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 21 

ccactcaaag aaaaagtgtg acgagctcgc 30 



<210> 22 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 



<400> 20 

ccgggagagc catagtggtc tgcg 



24 



<400> 22 

ggcttgggca cggcctga 



18 
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<210> 23 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 23 

gcggtgaaga ccaagctcaa actcactcca 30 



<210> 24 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 24 

agaacctgcg tgcaatccat c 21 



<210> 25 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 25 

cccgtcatga gggcgtcggt ggc 23 



<210> 26 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 26 

accagcaacg gtgggcggtt ggtaatc 27 
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<210> 27 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 27 

ggcacgcgac acgctgtg 

<210> 28 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 28 

agctagccgt gactagggct aagatggagc 



<210> 29 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
DNA (primer) 

<400> 29 

aacaagatgg attgcacgca 



<210> 30 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
DNA (primer) 



<400> 30 

cgtcaagaag gcgatagaag 
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<210> 31 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 31 

gcactctctg cagtcatgcg gctcacggac 30 



<210> 32 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 32 

cccctgtgag gaactactgt cttcacgc 28 



<210> 33 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: : synthetic DNA 
<400> 33 

ccgggagagc catagtggtc tgcg 24 



<210> 34 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: : synthetic DNA 
<400> 34 

ccactcaaag aaaaagtgtg acgagctcgc 30 



<210> 35 
<211> 18 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
DNA (primer) 

<400> 35 

ggcttgggca cggcctga 18 



<210> 36 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: : synthetic DNA 
<400> 36 

gcggtgaaga ccaagctcaa actcactcca 30 



<210> 37 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthet ic DNA 
<400> 37 

agaacctgcg tgcaatccat c 21 



<210> 38 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthet ic DNA 
<400> 38 

cccgtcatga gggcgtcggt ggc 23 



<210> 39 
<211> 27 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: : synthetic DNA 
<400> 39 

accagcaacg gtgggcggtt ggtaatc 27 



<210> 40 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: : synthetic DNA 
<400> 40 

ggaacgcgac acgctgtg 18 

<210> 41 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: : synthetic DNA 
<400> 41 

agctagccgt gactagggct aagatggagc 30 
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10 20 30 40 50 60 

ACaxsececo aauaggggog ac&cuccscc axjgaaucacu ccccugug&g gaagoactgcj 

70 SO 3Q .3-00 110 120 . 

CUUCACGCAG MftSOGCCOA GOSKJGGQSU . UAGUADGAGU GOOGOACftGC CUOCAGGOCC 

130 140 150 ISO 170 160 

CCXXraOOCG GGftJGftGCCaU ASOGGUCtRSC GGARCCGCTG JU3DACACOGG AAUOGCOGGG 

190 200 210 220 230 250- 

AASACUGGGtf AUAMCCC&C tJCUAXJGCCOG GCCAUUUGGG CGtXSCCCCCG 

250 269 - 270 2&0 390 300 

CAAGACUGCU AQCCGAGUAG CGUUGGGUOG OGAA&GGCOT UGUGGOACUG CCDGA0AGGG 

310 320 330 3fi0 350 360 

CGCOUGCGAG CQCOCOGGGA GG5JCDOGUAG ACCGUGCACC AUGAGC&CAA AUCCUAAAC5C 

3?0 380 390 400 SXG 420 

UCAAAGAAAA AOCAAAftGAA ACACCAACCG UOGCOCfiAUG AUDGAACAAG AOGGAUUGCA 

4B0 440 450 460 470 580 

CGC^GOJUCU CGGGCQGCUO GGGUGGAGAG GCUAOTCGCC UAUGACDGGG CACAACAiGAC 

jgg 500 SlO 520 530. 540 

AAVOGGCOGC UCOGWOGCOG CCGUGUUCCG GCUGUCAGCG CAGGQGCGCC CGGUUCUUUU 

S50 560 570 680 59D 600 

USJCAASACC GACCOGOCOS OUGCCCOGAA tfSAACOGCAG GACGAGGCftG 

£10 620 €30 640. $50 660 

AOGRCGGGOG TXtCC&OGCGC AGCCK3CGCUC GACGOUGUCA CEGAAGOGGG 

670 SSO 6&Q 700 710 720 

AAGGGACUGG CUGCUAUUG6 GCGAAGtRSOC QGQGC&GGMJ CUCCDGUCAU CHCXCCHOGC 

730 TSG 750 760 770 760 

UCOTGCC6AG AJttGOMJCCA UCAUGGCUGA UGGAAUGCGG CGGCDGCAUA OGCUUGADCC 

790 eoo sua $20 aso si© 

GGCmCCOGC CCAUITCGACC ACCAAGCGAA ACAXJOGCAUC GAGCGAGCAC GUAOJCGGAU 

a$0 . StfO S?0 380 §50.. - 9dQ 

CGAAGCCGGP CUUOJCJGAIJC AGGftUGAUCU GGACGAAGAG CAUCAGGGGC UCGCGCCAGC 

510 920 930 $40 950 950 

O6A&C0GUUC GOCftGGCOCA AGGCGCGCAU GCCCGAOGGC GAGGATJCUCG UCGUGAOCCA 

370 980 590 1000 10X0 1§20 

CGGCQAUGCC DGCOtfGCCGfc AVAUGMKGU GGAAAAUGGC CGCUQUUCCC GAUUCAUCGA 

1030 1040 10SO iOSO X070 1030 

CUGUGGCCGG CU3GGUGUGO OSGACCGCOA UCAGGACAUA GCGUUGGCDA , CCOGUGAUAU 

1090 X130 1110 1120 1130 11,40 

UGCUGAAGAG CUOGGOGGCG AA'0O5GCU€A OCGCUUCCUC GUGCUUUftOG GUAUOGCCGC 

1130 U60 1170 118$ U90 1200 

CCCOGfiUlICG CftGCGCftECG OCUUCQAWOS COJUCDUGAC GAGUUCOUCCJ GAGDUUAAAC 

12JL0 1220 . 1230 1250 1250 125$ 

CCOCOCCCUC CCCCCOCCCU AACGUUACOG GCCGMCCCG CUDGGAAUAA G6COG60GDG 

1270 12S0 1290 1300 1310 1320 " 

CGUUUGUCUA WttfGOTAUTO UCSaCCAUAU UGCCGUCUUU UGGCAAOGUG aGGGCCOGGA 

1330 1340 1350 1360 %3f0 1380 

AACCUGGCCC TOGOTUCOUG ACGAOCAUUC OJAGGGGUCU CUCCCCUCOC G0CAMG6AA 
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I^OO 1410 1430 1<H0 

UGCAAGGUCU GtJDGAAUGUC OJGFAGOftAG QP&VOCQUCD GGAACXXJUCU UGAAGACAAA 

1450 1450 1470 1466 1490 1S0G 

CAACGUCUGO AGOfiftOCOOU OGCAGGCAGC GGAACCCCGC AOC0G6OGAC AGGUGCCDCO 

15X0 3520 1530 1530 1550 1560 

CCGGCCAAAA GCCACGUGOA UAAGAUACAC COGCAAAGGC GGCACAAOCC CftGUGCCAOG 

.1S7S 1580 1590 1600. 1610 1620 

UUGUGAG0UG GAOAGUUGUG GAAAGftGOCA ARUGGCUCUC CUCAAGOGUA UUCAACAAGG 

1630 1640 1650 1660 1670 1680 

GGCtiGAAGGft. TJGCCCAGAAG GUACCCCAtfU GUAUGGGAUC OGAUCUGGGG CCtfOGGUGCA 

1690 1700 1.710 1729 1730 1740 

canecummc augugoouag ucgagguoaa AaaAACGucu aggccccccg aaccacgggg 

1750 1760 1770 1780 3790 ' 1300 

AGGUGGUUUtt CCUUUGAAAA ACAOG AUGAU ACCAUGGCOC CCAUCACUGC UUAUGCCCAG 

3.8X0 1&23 1630 1840 1S50 I860 

CttAACACGftG G0CUCC0G6G CGCCAUAGUG GUGAGUAIX3A CGGGGGGUGA CAGGACAGAA 

1870 XS8G 1690 £$00 1310 1920 

G&GGCCX3GGG AftGUOCAAAU CC&X3CDCAC& GOCOOTCAGtf CCXIUCCUOGG AACAACCACJC 

1530 Z*4C 1950 .I960 1970 1980 

UOGGGGGUUq UGOGGACUGO UPACCACGGA GCUGGCAACA AGAOJOJAQC CGGCOUAOGG 

1$9G 2000 2010 " 2020 3030 2040 

GGUOCGGUCA CGCAGAUGOA CtfOGAGUGCU GAGGQGGACU UGGUAGGCOG GCCCAGCCCC 

2050 2G60 2070 20S0 2090 2XpQ 

CCOGGGACCA AqtyCPUDQGA GCCGOGCftAG UGUGGAGOCG GCGfcCCOAUA UCOQGOCACG 

2110 2X20 2130 3140 2150 2X.60 

CGGAACGCUG AUGUCAtDCOC GGOJCGGAGA CGCGQGGACA AGCGGQGAGC ADBGCCttJCC 

2170 2160 2190 2200 2210 2320 

CCGAGACCCA Ot^CGACCtfD GAAGGGGUOC UCGGGGGGGC OGGUGCOOUG CCCUAGGQGC 

2230 2240 2250 2260 2270 2280 

CaOGOCGUUG GGCUCUOCCG AGC&GOT30G UGCUCUCGGG GCGUGGCCAA AXKXAUCGAU 

22$0 2300 2310 2320 ' 2330 2340 

UUCATOCOCG OTGAGACftCU OGAOGUUGOU ACfc&SGDCGC CCACDUUCAG OGACAACAGC 

2350 23$(3 2370 2380 2350 2400 

ACGCCACCGG COQUGCOCCA GACCUAUCAG GUCGGGUAOJ CGCASGCUCC AACUGGCAGU 

242fc 2430 24*0 £450 2460 

<30AAAGASCA CCAAGGCCCC UGUOGCGUAO GCCGCCCAGG GGOACAAAGH ACUAG0GCOU 

2-170 2480 24$0 250<? 2510 2520 

ftACCCCtrOGG UAGOH3CCAC CttXSGGGUUU GGQGCGUACC UAUOCftAGGC ACAUGGCAUC 

2530 2540 2550 2560 2570 25S0 

AAUCCCAACA OUAGGACUGG AGUCAGGAOC GUGAUGACCG GGGAGGCCAU CACGUACCCC 

2530 5600 26X0 2620 2630 2640 

ACAUAUGGCA AAUUOCUCGC CGAUGGGGGC UGCGCUAGCG GOGCCUAUGA CAUCAUCAUA 

2S50- 2660 2670 2680 2550 2700 

EGCGAUGAftU GCSCACGCUGU GGAUGCU&QC UCCAUUCDCG GCAUCGGAAC GGUCCOOGAU 

2710 2720 2730 2740 275D 2760 

CAfiGCPCAGA CAGCCQGGGU CAGACUAACU GUGCUGGCDA CGCCCftCACC OCCCGGGUCA 
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2770 


2780 


27^0 


wuunLniiuvA* 






2830 


£830 


3850 








2890 


2900 


2920 


csy-xFCA/wkGA 




CGAGOTCGCt? 




29&0 


2970. 


GUGGCAUACU 


AUAGAGGGuC/ 


GGACGUCUCC 


3010 


30£p 


3030 


GUGGCCACOG 


ACGCCCUCAU 


GAOGGGGUAC 


3070 


3080 


3090 


AAUGUAGCQG 


UCACOCft&GC 


UGUCGAOTUC 


3130 


3140 


3150 


CfcGACOGOOC 


CACAAGACGC 


UGUCUCACGC 


3190 


32.0b 


3210 


AGACAGGGCA 


C3JUAUAGGUA 


UGUUUCCACt? 


3250 


3260 


3370 


OTAGDGCOUU 


GQGAGOGCOA 


CGACGCAGGG 


3310 


3320 


3330. 


ACCACOGUCA 


GGCuUAGAGC 


GUAOUUGAAC 


3370 


3380 


3300 


COUGAAtJUUU 


GOSAGGCAGO 


UUUCACOGGC 


3430 


3440 


■ 3450 


CAAACAAAGC 


AAGCGGGGGA 


GAACUUCGCG 


34 $0 


3500 


35X0 


GOCAGAGCCA 


AGQCOCCUCC 


OCCGUCCOQG 


3550 


3550 


3570 


AAGCCUACGC 


□DGOGGGCCC 


CACACCUCUC 


3GX0 


3620 


3$30 


GUCAOCCUCA 


CAC&OCCDGG 


GACGAftSUAC 


3670 


3680 


3690 


G0CAUGACCA 


GCflOGUGGGu 


CCUAGCUGGA 


3730 


3740 


'3750 




GAliGOGCUdC! 




3790 


3800 


3810 


GCGCOGGAUA 


AOGAGGuCCU 


GUAUGAGGCtf 


3850 


3860 


3S70 


GOGSCBCu'CA 


UCGAAGAGGG 


GCAGOGGAUA 




3920 


3930 


UDGCGGCAGC 


AGGCCUCUAA 


GCAGGCCCAG 


S97G 


39S0 


3990 


CCCAAAG&GG 


AACAAUTFJOG 


GGOCAGACAC 


4030 


4.04.0 


4050 


000GCAGGAU 




GCCAGGGAAC 


4090 


4100 


4110 


GCEGttCOCA 




GUGGACCAGU 



aeoo 2Bio 382© 

GUAGGCCUCG GGCGGGAGGG UGAGAUCCCC 

2880 SS70 2880 

J02AAG6GAG C<5A6ACft0at C&tmutf O&SC 

29.20 2930 2940 

GCGGCCCOTC GGGGCAUGGG CUUGAAUGCC 

2980 2990 3000 

AUATOAOGAG CUCAGG6AGA UGUGGUGGUC 

3D40 3050 3060 

AGUGGAGACU UUGACaCCGU GAl^CGACDGC 

3100 3210 3120 

AGCCUGGACC CCAOCU0CAC UAUAACCACA 

3360. 3170 3180 

AGUCAGCGCC GCGQG06CAC AGGUAGAGGA 

. 3220 3230 3240 

GGDGAACGAG CCUCAGGAAU GUUUGACAGU 

3£80 3290 3300 

CCCPGOGUQGU ACGAUCUCAC ACCAGCGGAG 

334$' 3350 3360 

AQGCOOGGCC UACCCGUOTG UCAAGACCAU 

3400 3410 3420 

CUCACACACA UAGACGCCCA CUUCCUCUCC 

3450 3470 34 B0 

0ACCUAGCJAG OCUACCAAGC UACGGDGDGC 

3520 3530 3550 

GAOGCCAOGU GGAflGUGOCU GGCO0GAC0C 

3500 3590 3600 

OJGDAOCGUU t5GGGOXlIA£J U&CCAAUGAG 

3640 36Sa 3660' 

AlPCGCCACAU GCADGCAAGC UGAOOJUGAG 

3700 3710 3720 

GGAGUCCUGG CAGCCGUOGC CGCAUAUUGC 

3760 3770 37&0> 

CGCUUGCACG UCAACCAGOG AGUCGUCGUU . 

3820 . 3830 3860 
UUOGAIK3AGA XK5GAGGAACK3 CGCCDCUAGG 

3680 3890 3900 

GCCGAGAUGU UGAAGUCCAA GAUCCAAGGC 

3940 3950 3960 

GACADACAAC CCXOTAUGCA GGCaJUCAUOG 

^3000. *1010 4020 

aogoggaacu ucmkj&gcgg cauocaauac 

. 4050 4070 . 4080 

GCteOGGOte CUUCCAUGAU GGCaUUCAGO 

4120 4130 4140 

AO^ACCAUCC DUCDCAACAD CAUGGGAGGC 
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n$0 4160 4X70 4130 5290 4300 

UGGTOAGCGU COCAGAUOGC ACCAGCOGCG GGGGCCACCG GCUOUGUCGU CAGtfGGQCUG 

4210 4230 4230 4240 4350 4260 

GOGGGGGOTG CCGCJGGGCAG CAUAQGCCOG GGUAAGGUGC UGGUGGACAU CCUGGCAGGA 

4270 4ag0 4290 4300 4310 4330 

OAUGGUGCGG GCAUUUCGGG GGCCCUOGUC GCATOCAAGA UCADGUOJGG GGAGAAGCGC 

4330 4340 435D 4360 4370 43S0 

UOJAOGGAAG AIXSOCAUCAA UCUACUGCCU GGGAUCCUGU CUCOGGGW3C CCUGGDGGI3G 

4390 450Q 4410 4420 • 4430 4450- - 

GGGGDCADCD GOSOGGCCAU UCOGCGOOGC CACGUGGGAC CGGGGGAGGG OGQ3OTOCAA 

4450 4460 4470 4490 4500 

UGGAOGAACA GGOJUAUUGC CUUUGCtfUCC AGaGGAAACC AOGUOGCCCC UAOICACUAC 

45X0 4520 4530 • 454Q 4550 45tfG 

GDGAOGGAGU COGPXJGCGOC GCAGCGDGQG ACCCAAOJAC UUGGCUCUCU UAOJAUAACC 

• . 4570 4S30 4590 4.660 4610 4620 

aGCOaCOCA GAAGACUOCA CAMTJGGAGA ACUGAGGACU GOCOCAtKXC AUGCUCCGGA 

4630 4640 4650 4560 4670 4680 

UCCUGGCUCC GOGACGOGDG GGACUQGGCIJ UGCACCAUCU VGACAGACUU CAAAAAUCJGG 

46$Q 4700 4710 4720 . ^730 4740 

CQGAOCOOTA AAOUGUUOCC CAAGCOGCCC GGCCPCCCCU UCfitraJCUJHS UCAAAAGGGG 

4750 4760 4770 47SB 4?$0 ' 4SC0 

UACAAGGGBG OGUGCSGCCGG CftOXSGCaOC AUGACCAOGC GCUGCCCUSG OGGCGCCAAC 

4810 4820 4830 4840 4850 . 4860 

AJDCUCOGGCA AOGUCCGCCU GGGCUCOADG AGGATJCACAG GGCCUAAAAC OX3CAUGAAC 

4B70 4S8D 4820 4900 4910 4S20 

accuqgcsgg sgacojbuoc oaotiaotgc uacacggagg QGOvgasogc gcogaaacoc 

4930 3940 !^950 4960 5270 49$0 

CCCAGGAACU ACaAGACOGC caocoggagg guggoggccu OGGAQ0AQGC GGAGGOGACG 

4590 5000. 5010 5020 5030 504O 

CAGCMK3GGU CGUAOXXm OGUAACAGGA CUGACCACOG ACAAUCOGAA AAUUCCUUGC 

5050 SC60 5070 50S0 S09Q 5100 

caacuaccuu cuocagagsu uuucdocugg guggaqggog ugcagauoca uagguuugca 

$3.10 S120 5130 5240' 5150 5160 

CO^ACACCAA AGCOGUCtfOD OOGGGABGAG eCOOCGUUCO" GCGIT0GGGCU UAAUUCCUAU 

S270 • S130 . 51^0 5200 $210 5220 

GCUGUOGGGU OOCAGCUuCC OJGUGAACCD GAGCCCGACG CAGACGUAUIJ GAGGUCCAXSG 

5230 5240 5250 5260 5270 5280 

CUAACAGAUC CGCOOCACAO CAOGGCGGAG ACOSCGGGGC GGOGOJCGGC ACGGQGAUCR 

5390 5300 5310 3320 533C 5340 

COTGCAUCUG AGGOGAGCUC CUCAGOGflGC CAGCUAUCAG CACCGUC3GCD GCGGGCCAOC 

5350 £3*0 5370 538C 5320 5400 

UGCACCACCC ACTGCAACAC C0MX3ACGUG GACAUGGUCG ADGCCAACtfJ GCUO^OGGAG 

.5410 5420 5430 5440 3450 5460 

GGCGGUGUG6 CCCAGACAGA GCtDGAGUCe AGGG0GOC0G UOOCGGAOJU nCUOGAGCCA 

5470 54S0 5430 5500 -551$ 5520 • 

AUGGCCGAGG AAGftGAGCGA CCTOGAGCCC UCAAUAOCAO CGGAGUGCAU GCUCOCCAGG 
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. 5530 5540 5550 . 5550* 5570 S58Q 

AGCGGGUUUC CftOGG<XCUtf ACCGQCOTGG GCAOGGOCtfG ACOACRACCC GCDGCOCeU© 

* 5590 5500 5610 ' 5620 5630 5530 

gaaupgugga goaggocaga oimccAAcoG cccwix5Gajix?.=a)GGuuGDsc ucucccccoc 

5050 5660 5670 5690 5690 5700 

GCCAAGA&JGG CCCOGACGCC UCCCCCAAGG AGAOGOCGGA CflGUGGGCCU GAGCGAGAGC 

57X0 5720 5730 5740 5750 5760 

ACCAUAUCAG AAGCCOTCCA GCAACUGGCC AUCAAGACCU CUGGOCftGCC CCCOJCGAGC 

. 5770 5780 5790 5800 5310 5820 

GSUGAUGCAG GCDCGUCCAC GGGGGCG6GC GCOGCOGAAU OOGGOGGOCC GACGDCCCCU 

5830 5840 5350 5860 5670 5880 

GGtXSAGCOGG COOGCUCAGA GACRGGUUOC GCCUCCDCUA UGCCCCCCCO CGAGGGGGAG 

5890 5900 59*0 5930 $950 594Q 

CCUGGAGAUC CGGACCOGGA GUOJGAUCAG GUAGAfiCUOC AACCUCCCCC CCAGGGGGGG 

5950 5960 5970 5900 5990 6CQ0 

GGGGUAGCOC COGGUTJCGGG CUCGGGGUCU DGGUCUACUU GCUCCGAGGA GGAOGAtfACC 

£010 602O • 6030 .6040 6050 £060 

ACOGUSUGCO GCOCCAD&JC AUACOCCDGG AOCGGGGCUC UAAOAACUCC CUGCAGCCCC 

6070 S0S0 6090 6100 6110 6120 

GAAGAGSAAA AGGUGCCAAU CAACCCUUUG AGUAACUCGC OGUUGOGAOA CCADAACAAG 

6130 6140 6150 6160 6270 6180 

GUGUACUGUA CftACAUCAAA CAGOGCCUCA CAQAGGGCUA AAAAGGUAAC UUUOGACAGG 

6190 €200 §210 6.220 6250 6240 

ACGCAAGOGC U05AOGOPCA UUAUGACUCA GUCUUAAAGG ACAUCA&GCO AGCGGCUUCC 

S2S0 6260 6270 6280 6290 6300 

AAOGUCAGOG CAAQGCUOCO GAGGOGTCCC AGWGflCUCC ACCCCAOUCU 

6310 6320 6330 6340 6350 6360 

GCAAGAUCCA AGUAUGGAUO CGGGGCCAAG GAGGUOCGCA GOJUGOCOGG GAGGGCCGUU 

6370 6380 5390 6400 ' 6310 6420 

AAO^CAOCA AGOOCGW5UG GAAGGACOJC CUGGAAGACC CACAAACACC AAOTCCCACA 

643Q 6530 , 6450 6560 6470 6480 

ACCMlCAOGe CCAAAAAUGA «5TOUUOX5C OCftAGQQGGG tmftGAAACCA 

649Q 6500 651.0 6520 6530 6500 

6CUGGOCUCA COGUUUACOC UGACCUOSGC CTCOGGGOCU GCGAGAAAAU GGOCXXTCOAU 

6550 6560 6570 6580 6590 6600 

GfcCfcfcOACSlC AAAAGCOUCC XCAGOCQOTA ABGGfiAGCOd OOJA!5(5e«ttJ CCfcOIACUCC 

6610 6620 6630 6640 6650 6660 

CO*3COCAAC GGGUGGAGOA UCCCUUGAAA GCSUDGGGGG AAAAGAAGGA CCCCAUGGOT 

6670 6580 6690 6700 6710 6720 

UUUuCGUAOG AUACCOGAOG CUUOGACUCA AOOGaCACUG AGAGAGACAU CAGGACOGAG 

6730 6740 S750 6760 6770 6780 

GAGUCCADAU ACCAGGCOTG OKTCtfGCQC GAG6AQ6QCC GCACOGCCAQ ACACCJCGCtlG 

6790 6800 6810 6820 6830 6840 

ACBGAGAGAC U0OTKXS0AGG AGG50CCAUG UUCAACAGCA AGGGUCAAAC CDGGSGUUAC 

SS5G 6860 6870 6880 6890 ' 650(5 

AGACX3UUGCC GQGCCAGOGG GGUGCUAACC ACUAGCAUGG GUMQ&CCW CACAUGCQRU 
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69*0 S920 6930 5940 6950 6950 

GUGAAAGCCC UAGGGGCCUG CAAGGCOGCG GGGAVAGUOG CGCO^CAAU GOPGGUAWC 

€970 6980 . 6990 . -lOOfc 7O10 7020 

GGCGAUGACC UAGUAGUCAU CUCAGAAAGC CAGGGGACOG AGGAGGACGA GCGGAACCOG 

7030 • 7040 7G50 7060 7070 7080 

AGAGCCUUCA 03GAGGOCAU GACCAGGUAC GCOGCOCCUC CCQGDGACCC CCOCAGACCG 

7090 7160 7150 73.30 7140 

GAAUAUGACC UGGAGCUAAU AACAUCCUGU UCCUCfiAAI^S XJGUCUGUOGC GXXSQGCGCG 

7X50 7160 7*70 71$0 2190 7200. 

COGGGCCGCC GCAGAUftCDA COX3AOCAGA GACOCAAOCA CUCCACtfCGC COC23GCUGOC 

7210 7220 7230 72*0 7250 72$0 

UGGGAfiACAfi UURC3ACACOC CCCUAUCAAU DCAUGGCUGG GAAAC&IXSO CCAK1AU2CU 

7370 7380 7399 730D 7310 7320 

CCAAOC&OAU OGGOUCGCWD OGtXCUAAUG AOW^CUUCU UCUCCAUUCU . CAUGGUCCAA 

7330 7340 ?3S0 7M0 7370. 7380 

GACACCCUGG ACCAGAACCU CAACUUUGAG AUSUftXXSGAU CAGUAUACUC CGTCAATJCCU 

7390 7400 7410 7420 7430 7/140'. 

UUGGACCUUC CAGCCAUAAV UGAGAGSOUA CACGGGCUUG AOGCCOOUDC UAUGCaCACA 

7450 7460 7470 7480 7490 7500 

UACUCUCACC ACGAACDGAC GOGGGUGGCU UCAGCCCtfCA GAAAACUOGG GGOGCCACOC 

75X0 7520 7S3Q 7540 7550 7S6Q 

CUCAGGGUGU CGAftGAGUOG cgcucgcgca gucagggogu ccotcaucuc ccguggaggg 

7570 7580 7590 7600 7610 7620 

AAAGCGGCOG COUGCGGCCG AUAUCUCUUC AAUUGGGOGG UGAAGACCA& GCUCMACUC 

7630 764G 7650 766Q ?6?0 7680 

AC0CCAU13QC OGGAGGCGOG COTACUGGAC UyAUCCAGUU GGUUCACCGU CGGCGCCQGC 

7690 7700 • 7710 77.20 7730 7750 

GGGGGCGACA UUUUDCACAG CGDGUCGOGC GCCCGACCCC GCUCAUUACO CUUCGGCCUA 

. 7750 /760 7770 . . 1780 7790 7800. 

CDCCUACUUU UCGDAGGGGU AGGCOJORJC CUACUCCCOG CtfCGGEAGAG CGCC&CACAC 

7810 7S20 7830 7840 7850 7860 

UAGGUACACtT CCftUAGCUAA COGODCCUUlI UUUUITOUOUU UOUUUUUOUO DUUUUUXStJUU 

7870 7880 7890 7900 7910 7920 

CUOOOTUUUU CCUUUUUOUU TJUOTUCCCUC UOTCOCOCOJ VOJCmCXJUA UDCUA030UC 

7930 7940 7950 79$0 7970 7980 

OOCCOUQGOG GCOCCAUCUO &GCCCUAOTC ACGGCUAGCU GUGAAAQGUC CGUGAGCOGC 

7990 8000 8010 3020 &Q30 8040 

AUGACUCCAG AGAGUGCCGU AaCOGGOCUC UCUGCAGAUC AUOT 
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20 20 30 40 50 60 

AcecGCcdcu aatogggocg acacuccgcc aix^^cu ccccuswsaG gaaojacugu 

-.70 SO 90 . . WO 110 120 

CTOCAOCtCAG AAAGCGUCUA GCCAUGGOGU CAGOAUGAGU GOOGUACAGC CUCCAGGCCC 

iaO 140 150 156 170 180' 

oecceuccco <3g&gagccau aguggucugc ggaacgqgog agoacacxdqg aaijugooggg 

190 200 210 • '220 230 240 

AAGACOGGGG CCUUUCUDGG flDAAACOCAC UCUA1K5C005 GCCAUUUGGG CGUGCCGCOG 

250 260 270 280 . 290 300 

CfcAGACDGCU AGCCGftGUAG CSUOGGGUUG OGAAAGGOCU UGUGGOACDG 0OJ6AU&QGG 

310 320 330 340 ' 350 360 

XJGCUDGCGAG UGCCCCGGGA G6UCUCGUAG ACCGUGCACC AUGAGCACftA AUCCCAAACC 

370 380 WO 40O 410 420 

DCAAAGAAAA ACCAAAAGAA ACACUAAC06 OGCCC&AUG AUUGAACAAG ATX3GAUOOCA 

430 440 4S0- -460 .470 4$0 

OGCAGGUUCO COGGCCGCOU GGGUGGftGAG GCUAUUCGGG UAUGACUGGG CACAACAGAC 

490 -500 510 520 530 540 

AftUOGGCUGC UOJGAUGCCQ CCGDSUUCCG GQJGUCAGCG CAGGGGCGCC CGGUUCUUOU 

r ■ . * * • *' . 

S50 £60 : 370 5S0 590 500 

t)6tK3A<SA0C OAOCUGUCOG GUGCCCUGAA UGAAOJGCAG GACGAGGCAG OGCGGOJAUC 

610 620 630 540 650 5€0 

GOGGCUGGOC AGGACGGGCG UUCCOTGCGC AGCUGtSGOIC <3AO3U0(3UCA COGAAGCGGG 

670 6$0 690 700 710 720 

AftGGGACUGG COGCUAOUGG GCGAaGUGCC CQGGCAGGAU OTCCUGUCAU COCACOHJGC 

730 74.0 75G 760 770 7£0 

XJCCUGCC6AG AAftGUAUCCA UCADGGCOGA UGCAAGGCGG CGGCUGCAOA CGOTOGAUCC ■ 

790 " 800 330 930 ' 340 

GGOIACCUGC CCMJUOGACC ACCAAGOGAA ACAUCGCAUC CAGCGAGCAC GOACUOGGAU 

550 860 " 870 . S80 690 500 • 

GGAftGCCGGU COUGttoGATIC AGGAUGAUOJ GGAOGAAGAG CAUCW3GQGC UCGCGGCftGC 

910 930 $5$ J>40 9S0 960 

CGAACIfqnOC GCCAGGCUCA RGGCGCGCAU GCCOGftCGGC GAGGACJCOCG UCGDGACCCA 

9f70 9S0 990 100Q 101O 1020 

UGGOGAtKSCC tJGCmiGCCGA AT2&8C8UGGCJ GGAAAftUGGC CGOIUUUOJG GAUOCAUOGA 

1030 1040 10SO 10SQ . 107O 1080 

O3GDGGC0GG CO&GGCGUGG OGGACCGCUA UCAjQGACAUA GCGUBGGOIA CX33GOGMJAU 

3090 1100 1X10 1120 3.130 1140 

GGCDGAAGAG COOGGCGGCG ft&UGQGCUGA CCX5CUUCOTC GDGCUOUAOG GUSOOGCOSC 

1150 1160 1170 1180 1190 1200 

UCCCGSUUCG CftGCGCAtlCG CCUUCUAUCG CCUUCUUGAC GAGOTCUtfOF GAGOUUAAAC 

1210 1S20 1230 1240. 1250. 1260 

cojcucgcuc caceecoGra . aacgudacug gocgaagccg couggaauaa ggccggugug 

1270 1280 1290 J300 1310 - 1320 

CGUUUGUCUA mUGOTAUUU UCCACCAUAB DH3GGUCU0U UCSGCAAtJGUG AGGGCCCGGA 

1330 1340 1350 13SQ - B70 " 1330 

ARCCUGGCOC WUCT0OT33 ACGAGCATOC COAGGGGUGJ UDCCCCUGUe GCCAAAGGAA 
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1390 1400 1410 1430 1430 

UGCAmGSUCu GUUGAAUGPC €UG?^WGAAG CAGUUCCUCU CGMGCDUCU UGAAGACAAA. 

1450 14*0 1/570 1480 1490 1500 

CAAOGUCOGO AGCGACCCOD tK^GGCAGC GQUVCSCCCOC ACCUGGOGAC AGG06COJCO 

1510 1520 1530 1546 1550 1560 

GCGGCCAAAA GCCAO6CG0A UAflGaU&CAC COGCftAftGGC GGCftCWVCCC CftGOGCCAOS 

2570 1580 15S0 1600 1610 1620 

OUGUSAGUUG GAUAGUBGUG GAAAGAGUCA AAUGGOXUC CUCAAGCdOA 1KJCAACAA<5£ 

1530 15-10 1650 1660 1670 1680 

GGCUGAAGGA. OGCCCAGAAG GUACCCCAUU GOAUGGGA0C OGSUCDGQGG COJCGGUGCA 

1650 1700 1710 1720 1730 1740 

CAUGCDUUAC AJUGOGUOUAG UCGAGGUUAA AAAAACGUCO AGGCCCCCCG AACCADQGGG 

1750 1760 1T70 17«0 1790 1300 

ACGUGGUUUD OCBUOQAAAA ACAOGftDtRAD JKX3KJGGCOC CCABCACGGC OUAOGCCCAG 

1810 1820 1830 1S40 1BS0 I860 

CAGACACGAG GUCUCUIX3GG CUCUAUAGUG CTGAGCAOGA CGGGGCGUSA CAAGACAGAA 

ie7o ia$0 i$90 i»oo 1910 132.0 

CAGGCCGGGG A3S0CCAAGU CXSPOTCCfcCfc CtK^COCAOT CCOTCOIOGG AACAUCCAUU 

1930 1940 1950 1360 1970 1980 

UCQGGGGBCO OAtXSG&CUGO OUACCftCGGA GCDGGCftACk J^CACUAGC CGGCUCGCGG 

1990 2000 2010 2020 203C* 2040 

GGCCXX3GUCA CGCftGAUGtfA CUCGAGCGCC -GAGGGGGACU DQGUCGGGOG GOCCAGCCCO 

2050 2060 2070 2080 2090 2100 

COJGGGACCA AAUCUtfUGGA GCCG0GUACG OGUGGAGCQG UOGACCUODA UUOGGUCACG 

2\%Q 3120 2130 2140 2150 . 2160 

CGGAAOGCOG MGCCADCCC GGCUOGAAGA CGCGGGGACA &GCX3GGGAGC CCOGCOCUOC 

2170 2180 219Q 2200 2210 2220 

CCGAGACCCC UISXXS&COJO GAAGGGGUCC T20GGGGGGAC CCCO&GGGGC 

2230 2240 22&Q 2260 2270 2280 

C&GGCUGUCG GAAUCDUCOG GQCfiGCUOOG UGCUCUCGGG GUGDGGCUAA GUCCAUAGAU 

2290 2300 33XG 3320 2330 .2340- 

UUCAUCCCOG DDGAGAOGCU 03ACAOCGOC AOGCGGUCtfC CCACCUUURG UGACAACAGC 

2350 2360 2370 2380 2390 2400 

ACACCAOCAG COGIX5CCCCA GACCUAtlCAG G0QGQGUACO UQCAO9CC0C CACvOGC&GU 

2410 2330 2430 2440 2450 2£60 

GGAAAAAGCA ccaaogococ cgocgoguac gcoggccagg gotauaaago gcoggugcoc 

2470 2480 2490 25 Q0 2510 2520 

AAUCCCUCGG UGGCUGCCAC CCUGGGAU0U OGGGCGUAC0 UGOCCAACGC ACAtfGGCftOC 

2530 2540 2550 2560 2570 * 2580 

AACCCCAACA UUAGGACUGG AGCCAGAACU GOGAOGACOG GGGAGCCCAO UACADACUCC 

2590 2600 2&tf) 2620 2630 5^40 

ACGUfctfGSUA AAUUCCUCGC CGAUGGQGGC IX30GCAGGOG G0GCCUAUGA CAUCAUCAUA 

2650 2660 2670 2680 - 2630 2700 

UGOGAUGATO GCCfcOCOGO CG^UGCJACC GCA0C6GGAC AGuCCUUGAC 

2710- 3720 2730 2740 2750 2760 

CAAGCAG&GA CftGCOGGGOT CAGGCUAftCO OTAOJGGCC& OGGCC&CGCC CCCCX3GGUCG 
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2770 2760 2790 2800 2840 2820 

GOGACAACOC COCAUCCCAA UATFAG&GGAfi GOAGCCO/QS GACAGGAGGG UGAGAWCCOC 

- 2830 2840 2850 2660 2810 2880 

UDCOAtJGGGA GGGCGUUUCC CCUGtJCUUAC AUCAA<3GGAG GGAGGCAOJU GAUUUUCHGC 

2890 2900 . 2910 2920 2930 29*10 

t^CUCAAAGA AAAAGUGUGA OGAGCUOGCA ACGGCCCU0C GGGGCAUGGG C0UGMCGCU 

2950 2960 2970 2980 2990 . 3G.00 

GUGGCAUAUO ACAGAGGGUD GGACGUCDCC AUAAUACCAA COCAAGGAGA IXSOGCUGGUC 

303.<J 3020 3030 3040 5050 30$0 

OTUGCCACCG AOSCCOOCAO GACGGGGOAD ACUGGAGAOJ OUGACUCOGO GRUOGACOGC 

3070 3080 3090 3*00 3010 3120 

AACGOAGCOS UCACCCACGC CGOAGACBOC ASOOX3GACC CK2ACCTUCAC UADAAOCACA 

3130 314Q 3X50 . ' 31$C 3X70 31.80 

CAGACUGUCC OGCAAGACGC UGXICtJCteGU AGOCAGCG0C GAGQGOQCAC CGGOAGAGGA 

3190 3200 3210 3220 3230 35*0 

AGA03GGGCA UUUAUAGGUA UGUUUCCACU GGOGAGOGAG CCOCAGGAAU GUUUGACAG0 

32S0 3560 32.70 3280 3290 3300 

, GUAGUACOCU GUGAGUGOJA, OGACGCAGGA QCOGCOUGGU AOGAGCtJCOC ACGAGUGGAG 

3310 3320 3330 3340 3350 3350 

ACGACOGOCA QGOJCAGGGC GUAUITJCAAC AOGCCOGGCD DGCCDGUGUG CCAGGAOCAC. 

3370 . 3360 3390 3400 3410 .* 3420 

CUUGAGUUOT GGGAGGCACU 0UUCAOOGGC COCACACACA TOGAOGGDCA UOQCCOUOCC 

3430 3440 3450 3460 3470 3480 

C3GACAAACC AGuCGCGGGA AAAUUTOGCA OACUUAGOAG COJAUCAOGC GACAGDG^GC 

3490 3500 3510 3530 3530 3540 

GCCAGGGOCA AftGOGCCXXC CCCGtJCCtfGG GACGUCAUOT GGAAGUGCOU GACUOGACOC 

3550 3550 357G 3580 3590 3600 

AAGCCCAOGC BUeOGGGOOC UACAOCUCUC O3GUACCG0U IX3GGCUCUGU UACCAACGAG 

3530 3620 3630 £640 3650 . 3€6G- 

GUCACCCUUA CACACCCOGU GACAAAAtfAC AUCGCCACAU GCAUGCAAGC UGACCDOGAC 

3570 36&0 ' 3690 3700 3710 3720 

GUCAOGACCA GCAOGUGGGU OCUGQOJGQG GGAGUOJUAG CAGCCGUOGC OSCGUAUDGC 

3730 " 3740 '3700 3760 3770 3780 

UUAGOGACG0 GGUGUGUDCC CAUCAUUGGC CGUUUACACA UCAACCAGCG AGCUGUOGUC 

3790 3S00 3810 3820 3830 3840 

GOJCOG6ACA AGGAGGUCCQ COBBGAGGCO UU0GAUGAGA UGGAGGAAXJG U6CCDCCAC3A 

3850 3850 3870 3880 .3890 3900 

GCGGCUCDCC GCAfiCGGAUA GCCGAGAIKSC UGAAGUCCfcA GAUCCAAGGC 

3910 3920 3930 * 3940 3950 3960 

UDADUGCAGC AAGCCOOTAA ACAGGCQCAG GACAUACAAC CCGCUGUGCA AGCUUOC5UGG 

3970 3980 3990 4000 4010 4020 

CCCftAGAUGQ AGCAAUUOK3 GGCCAAACAU AUGOGCAACU UCAUAAG0G0 GAUUCAGUAC 

4030 4G4fO 4050 4060 5070 4080 

C0O3CAGGAC IK?JCAAGACU GCCAGGGAAC COHSCDGUQG COUCCAUGAU GOCAUUCAGC 

4090 ' 4100 . 4X10 4130 4340 

GCCGCCCuCA CCAQUCCGUU QOCAACUAGC ACCACCAUCC UUCUUAACA0 
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d?SG 4X60 4170 4130 4190 4300 

UQQCOGGGGO OCCfcMtfUGC GCCACCCGGG GGGGCCAOJG GCUUUGUUGU CAGUGGCCIX5 

4210 *220 - ^30 4240 4250 4260. 

GDGGGAGOXS CDGUUSGCAfc CAUftGGCOUG GGtf AAAGUGC UGGUGGACAU OCUGGCftGGG 

4270 4230 4230 4300 431$ 4320 

UAOGGUGOGG GCAUUUCEG3 GGCCCUOGUC GCGUUUAAGfi UCfiUWCUGG CGAGAAGCCC 

4330 4340 4£S0 4360. 4370 4380 

UCCAUGGAQG AJ0GUOKJCAA CDUGCOGOCU GGGADUCDGU CUCCAGGUGC UOK3GIX3GUG 

439& 4.400 44*0 4420 4430 4440 

GGAGOCaDCU GOGOGGCCAU UCUGCOCCGC CRUGtX3GG&C OGQGGGAAGG CGCGGUCCAA 

4450 4460 447G 44&Q 4490 45Q0 

OSGAUGAACA GGQJU7VUOGC CUUCGCUDCC AGftGGAAACC ACGOCGOOOC UACUCACUAC 

4510 4520 .4530 4540 45-SG 4560 

GUGAGGGftGO OGGAUGOGUC GCAGOGUGUC ACCCAACW3C UTCGGCtfCUCO CACUMOAACO 

•4570 4S90 4600 4610 £6&0 

AGUCUACttCA GCAGACUUCA CAACUGGADC ACOGAGGATO GCCCCAt^CC AUGOGOCGGC 

4630 4640 4650 4660 4670 4680 

. UOGUGGCPCC GOGAOGOGUG GGACX3QGGUC tJSORCCftUCC UAIO^AGOU UAAGAACUGG 

463Q 4700 4720 4720 - 4730 4740 

OJGACCUGCA AGOXSUOCOC AAAGAXXSCCU GGGOJCCCCU UUADCDCTOG OCAAAAGGGG 

4750 #760 4770 4780 4790 4860 

UACAAGGGOG DGtXSGGGCGG CACUGGCADC ADGACCAC&C GAUGCCQCUG CGGOGCCAAC 

48X0 4320 4830 4840 4£5<X 4860 

AUCUCUGGCA ACGUCCGCUtf GGGCUCUAUG AGAAUCftCAG GACCCAAAAC COGCAUGAAC 

4a?0 4380 4890 4900 . 4510 4920 

AOCOGGC&GG GGACCUTOOC UADCAHJOGU UAUACAGAAG GCCAGUGCUtT GCCGAAACCC 

4930 4540 43JS0 4960 4570 4980 

GCGUUAAACtf OCAAGAOCGC CAUCUGGAGA OTGGCGGCCD CAGAGOAOGC GGAAGOGAOG 

4550 5000 30X0 5020 5030 5040 

CAGCACGGAU C&UAUGOCUA UAUAACA<3GG COGAOCACUG ACAACUUAAA AGUOOCOOQC 

S050 $060 .51370 5050 5090 SXQQ 

C^OJOCCCtf CUCCAGAGUtf WUCpCOyG<? CUGGAOGGAG UAO^UOCA UAGGUCCGOC 

5110 5120 5130 * 5140 5X50 5160 

CCCACACCAA AGOCGUUUUU COGQGAUGAG GUC0OGUUCR GOGOUGQGCU CAAUUCAUtTO- 

5X70 S160 5X90 5200. 5210 522G 

G5JQGUOGGGU CUCAGCUUCC CUGPGACOCd GAGCCOGACA CGGAGGUAG0 GAUGOOCHDG 

5230 5240 5250 5260 S270* 5230 

CUAACAGACC CAUCCCOTAD CACGGCGGAG GCUGCAGCGC GGCGUWAGC qQSGGGGOCA 

5290 5300 53X0 5320 - 5330 5340 

CCCCCAUCDG AGGCAAGCUC CUCAGOGAGC CAGCUGUCGG CGCCAOCGOT GCGAGCCACC 

5350 5360 .5370 5350 5390. : 5400 

t^CACCACCC AOGGOAGGAC CUAUGAUGUG GACAUGGOGG AUGCCAACCU GUUCAUGGGG 

5«nO 5*24) 5430 5440 5450 5460 

GGOGGCG0GA UUCGGAUAGA GUCUGAGtJCC MftGUGGqCG OUCUGGAOJC OCUOGAOfCA 

54?0 54BO 5490 5500 ■ 5520 

ATOACCGAGG AAGAGGGOGA CCUUGAGCCtf 0CAGUACCAU CQGAGUAOAO GCtJCCOCftGG 
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5530 55^0 5550 5560 3570 5550 

AAGAGGUUCC CACOGGCCOU ACCGGCUUGG GGGCGGGCOG AUUACAACCC ^CCGOJUGOG 

5590 5500 55*0 $620 5330 5640 

OAAUGGUGGA AGW3GOCAGA UUMJCAACCA COCftCOGUUG CGGGCUGOSC UCOCCCCCCC 

5650 5660 5670 5690 5700 

CCCAMAAGA CCXX3GAOGCC DCCDCCftAGG AGAGGCGSGA CAGOGGGOCO GAGOGfsGAGC 

5710 5720 5730 5740 S7S0 5760 

ACCAOTGGAG AUGCCCUCCA ACAGCUGGCC AUCAAGUCCtf UOGGOCAGCC CCCOCCAAGC 

5770 5750 5500 5S1.0 5S20 

GGCGMJUCAG GCOJUUCCAC GGGGGOGG&C GCCGCoSaCQ CCX3GOGAUOG GACACCCCCU 

5B3Q 5850 5850 5860 5870 5880 

CAOGAGTOGG CUCUUUCGGA GACAGGUUCU ACCUCCCCCA UGCCCOCOCTJ CG2GGGGGAG 

5890 5900 5910 5920 5930 .594.0 

0CUGGGC3RCC CftGACOTGGA GOCUGAGCAG OUAGAGCUUC AACCttQCUCC CCAGGGGGGG 

S9SQ 5960 5970 5980 S990 6000 

GAGGCAGCUC CCGGCOQGGA CUCGGGGUCC DGGUCOACUU GCBOCGAGGA GGADGACOCC 

6010 €020 5030 60S0 S050 .6060 

g ga g OGq OCP GCUCCADGUC AUATJUCCDGG AOGGGGGCUC UAAUAACUCC TKXSOAGCCCC 

6070 6080 $090 6100 cilO 6130 

GAAGAGGAAA AGUOGCCAAU UAAOUOCUlfG AGCAACUCGC DGUUGOGAUA CCAUMC&AG 

6130 6140 61SQ 63.60 6X70 6180 

GUAUACOGUA CUACAUCAAA CAGUGOCtfCA COAAGGGCUA AAAAGGUAAC UUDUGAUAGG 

6190 6200 6210 5220 6230 ' 6240 

AUGCAAGUGC UCGA03CCUA UUAUGAUUCA GUCOUAAAGG ACAUCAAGCU AGCGGCCUCC 

6250 6260 6270 6280 6290 5300 

AAGGUCAGOS CAAGGCUCCO CACCDUAGAC GAGGCGUGOC AACJUGAGOCC ACCCCAC0CU 

6310 6320 6330 6340 6350 -6360 

GCAAGAOOCA AGDAUGGGUO UGGGGCUAAG GAGGUCCQCA GOTUGUO0GG GAGGGCOGUC 

6370 6360 6390 6400 6410 6420 

AAOCACBUCA AGUCCGUGUG GAAGGACCUC UUGGAAGACU CACAAACACC AAOTCCUACA 

5430 64*0 6450 6460 6370 6430 

ACCAUCAUGG CC&AAAAUGA GGOGOUCUGC OTGGACOCOS CCAAGGGGGG UAAAAAACCA 

6490 6500 6510 6520 6530 6540 

GCUOGCCUUA UCG0OOACCC OGACCOCGCC GGCAJQGG0OT GCGA6AAGAU QGCCCUUUAU 

5550 6560 6S70 ' '6580 6590 6600 

GAOGUCACAC AAAAGCOUCC ttCAGGOGGUG AUGGGGGCUCf CTOAtfGGCOT <XAGOAjCOCC 

6610 5620 6S3G 6*40 6650 6660 

CCCGCOCAGC GGGUGGAGOij OCUCTOGAAG GCAUGGGCGG AAAAGAGAGA COCUAXJGGGU 

6670 6680 5590 .6700 6710 6730 

UOKJOGUAUG AtfACCCGAUG CUIKK3ACUCA ACOGUCACUG AGAGAGACAU CAGGACOSftC 

S730 6740 6730 6750 6770 €7a0 

GftGXJCCWUKU ACCAGGCOKS OJCCUUAOCC GAGGAGGCOC GAAOOGiXAU ACACUCGCUG 

6790 6800 6010 S$20 5830. 6&40 

ACUGAGAGAC tHttAUGOGGG AGGGOCCR30G UUCAACAGCA AGGGCCAGUC CGGCG&GOAC 

6850 6860 £870 6860 6890 69Q0 

AGGCGUUGCC GCGCCAGOSG GGUGCUUACC ACUAGUBOGG GGAACACCAU CACA0GC0M? 
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692G 6020 &930 6930 6950 5950 

eUAAAAGCCC UAGCGGOJUG CAAGGCDGCG GGGAO&ATJUG CGCCCACGAU GCUGGOADGC 

6270 6980 6$9& 7000 7010 7020 

QSCGACG&CU OGGUCGUCAU ClfCAGAAAGC CAGGGGACUG AOGAGGAC38A GCGGAACCUG 

7030 7CW0 7050 7060 7070 7080 

AGAGCCt3UO\ CGGAGGCUAU OACCAQC5UAU DCUGCSOCCUC CUGGUGAC0C OCCCAGAO0G 

• 7090 7100 7110 7i20 7130 ?1<30 

G^UADGftOC OGGAGCUAAU AACAUCUUCT UCCUCAAAOG PGUCDGOOGC ACDUGGCOCA 

7150 7150 7170 7.180 '. 7a90 7200 

cafiGGCCQCC GC2W5AOftCUA COJGACCAGA GACCCCACCA CUUCAAUOGC CCGGGOJGCC 

721.0 7220 7230 7240 7250 72$0 

PGQ(?AAACAG CJOAGACACfJC CCXt^OCAW UCAGGGOJGG GAAACAUGftXJ CXTAGOAOGCO 

7^70 7280 7290 7300 7310 7320 

CX^UOCADaO ©SGUUCGCAU CGOCCUGAtJG ACACACUUCU OttJCCAUUCU CAIJGGCCCAG 

7330 7340 73S0 7360 7370 73B0 

GACAOCCOAG ACCAGAACCIX TJAACUUI3GAA ADGUAOGGftO CGGOGOACUC CGUGAGUCCQ 

7390 7400 7410 7420 7430 7440 

CUGGAOCUOC CftGCCMJAAU UQAAAGGUUA CAOGGGCOUG AOSOCUUOIC UCIX3CACACA 

74SO 74*50 7470 7.feo 7490 7.500 

DACaCUCCCC AOGAACtfeAC <30GGGDGGOJ tXttGCOOJCA GAAAaCtttfGG GGOGOCACCC 

7510 7520 -7S30 7540- 7550 7560 

CUCftGAGGGU GGAAGAGCCG GGCGOGUGCA GUUAGGGCOT COCUCAU05C C0GUGGGGGG 

7570 7SS0 7590 7600 7510 7620 

AGGGOGGCCG UUUGCGGDCG qUAOOJCUDC AACUGGGCGG UGAAGACOVA GCUCAAACUC 

7630 7640 7650 76S0 7570 7680 

ACUCCUUUGC OGGAGGCACG CCUCCDGGAU UUGUOCAGUU GGUUUACX^SU OGGCGCOGGC 

7690 7700 7710 7720 7730 7740 

GGGGGOGACA OTUADCACAG CGGSOOGOGU GCCCGACCCC GCCUAUUACU CCUUAGCCUA 

.7750 7760 7770 ' 7780 7750 7000 

CUOCUACUUO CTOUAGGGGO AGGCCUCUUC CUACIXXXCG CUCXSXit3AGAG . CGGCACACAU 

7810 7820 7B30 7850 7850 7860 . 

UAGCUACACU CCAUftGCUBA CUGUUCCUUU UUUUUUODUD UUOUUUUXSlU UUUUUOTUUU ' 

7870 78§0 78S& 7900 7910 ?920 

DDUUOTUUUU CUUOUUUUUU OTUUUOQOTC UODGUDCOCD SSCTJCAUCUUA UUCOACUOUC 

7930 7940 795D 7950 7970 7980 

UU0CUDGGDG GCUCCAOCTO AGCCCUAGUC AOGGCUAGCU GUGAAAGGOC CGUGAGCOGC 

7990 9000 8010 8020 $030 8040. 

ADGACTGCAC? AGAOK3CCSQ AftCUGGOCDC UCUGCAGAUC AttGO 
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